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Abstract 

This paper contains a review of the main results of a search of regular- 
ities in collective variables properties in multiparticle dynamics, regu- 
larities which can be considered as manifestations of the original sim- 
plicity suggested by QCD. The method is based on a continuous dialog 
between experiment and theory. The paper follows the development 
of this research line, from its beginnings in the seventies to the current 
state of the art, discussing how it produced both sound interpretations 
of the most relevant experimental facts and intriguing perspectives for 
new physics signals in the TeV energy domain. 
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1 INTRODUCTION 



The structure of the vacuum and confinement are still unsolved problems of Quan- 
tum Chromodynamics (QCD) after many years from its introduction as the theory 
of strong interactions. Sound experimental informations in order to approach the 
two problems can come from hadronic spectrum and multiparticle production data. 
Attention in the present work is focused on multiparticle production and concerns 
mainly collective variables properties of final charged particles in full phase- space 
and restricted rapidity intervals, i.e., of collective variables properties in those awk- 
ward sectors where perturbative QCD is hardly applicable. Guiding line is the con- 
viction that complex structures which we observe at final hadron level might very 
well be, at the origin of their evolution, elementary and have simple properties. 
These characteristics in the detected observables are revealed by the occurrence of 
regularities which are expected to contain signals of the original simplicity and to 
be expressed in terms of the minimum number of physical parameters. 

This research line, along the years, has been inspiring and quite successful for a 
phenomenological description, based on essentials of QCD, of the main experimen- 
tal facts in multiparticle dynamics. This paper contains a summary of the results of 
this endeavour, which might be quite stimulating in the approach to the TeV energy 
domain in pp and heavy ion collisions, and to the determination of their possible 
substructures. 

Multiparticle production has quite a long story and its understanding is indeed 
crucial for strong interaction. The first observation of such events goes back to cos- 
mic ray physics in the thirties of the past century: the extraordinary and impressive 
fact had been the non-linearity of the phenomenon. 

This unusual experimental observation attracted the attention of many theorists 
in the forties and early fifties: in particular, the work of E. Fermi on the thermody- 
namical model [jjZb and of L. Landau [2] on the hydrodynamical model should be 
mentioned. Of course the contributions by J.F. Carlson and J.R. Oppenheimer 
H.J. Bhabha and W. Heitler fl], W.H. Furry [5], H.W. Lewis, J.R. Oppenheimer, 
S.A. Wouthuysen [6], together with the pioneering work by N. Arley [0] should 
not be forgotten. Particular aspects of the new experimental fact were described, 
but the situation was considered not satisfactory from a theoretical point of view. 
It was only W. Heisenberg who understood that multiparticle production should be 
described in terms of a non-linear field theory of a new nuclear force (which we call 
today indeed strong interaction) [S]. 

With the incoming of the multi-peripheral model ^\\, an important step was done 
in the understanding of the cm. energy dependence of the average charged parti- 
cle multiplicity in high energy collisions in terms of a logarithmic function, a trend 
competitive with the square root rule proposed earlier on purely phenomenological 
grounds [10[| ; n charged particle multiplicity distribution (MD), P n , was predicted 
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to be Poissonian, when plotted vs. n, suggesting an independent particle production 
process. Few years later (in 1966), P.K. MacKeown and A.W. Wolfendale noticed, 
in cosmic ray experiments, remarkable violations in the dynamical mechanism for 
independent particle production, by observing quite large fluctuations of the pio- 
nization component in hadron showers originated by primary hadrons at different 
primary energies [11]. They proposed to fit high energy cosmic ray data on charged 
particle multiplicity distributions (MD's) in terms of a Negative Binomial (Pascal) 
multiplicity distribution [from now on abbreviated as NB(Pascal)MD]. This phe- 
nomenological distribution is in fact characterised by an extra parameter in addition 
to the average charged multiplicity n, i.e., the parameter k which is linked to the 
dispersion D: k = n 2 / (D 2 — n). D 2 > n implies indeed deviations from the Pois- 
sonian behaviour of the n-particle MD predicted by the multi-peripheral model to 
which the NB (Pascal) MD reduces for k — > oo. The Authors gave also a sound 
phenomenological expression for the energy dependence of k^ 1 , showing that it is 
a finite number which increases with the increasing of the energy of the primary 
hadron toward an asymptotic constant value (A; -1 ~ 0.4). 

The discovery in the accelerator region, in the seventies, of the violations of 
multi-peripheral model predictions on n charged particle multiplicity distribution in 
high energy hadron-hadron collisions [12] confirmed the cosmic ray physics find- 
ings. The parallel success of the NB (Pascal) MD in describing in full phase-space 
in the accelerator region n charged particle multiplicity distributionsat different p\ ab 
in various collisions (53 experiments were successfully fitted [131 , 14] led to guess 
that the distribution was a good candidate for representing multi-peripheral model 
prediction violations. Although a germane attempt to justify the occurrence of the 
distribution in terms of the so called generalised multi-peripheral bootstrap model 
[15] was quickly forgotten, its phenomenological interest remained in the field: the 
distribution was rediscovered in full phase-space for non-single diffractive events 
and extended to (pseudo)-rapidity intervals by UA5 Collaboration [16] at CERN 
pp Collider cm. energies, and then successfully used by NA 22 Collaboration [17] 
at y/s = 22 GeV in pp and 7r ± p collisions and by HRS experiment [ 18 ] in e + e~ 
annihilation at s/s = 29 GeV in order to describe P n vs. n behaviour both in full 
phase-space and in symmetric (pseudo)rapidity intervals. 

The last two experiments were of great importance: the first one established a 
low energy point in hadron-hadron collisions, the second one extended to a new 
class of collisions the interest for the NB (Pascal) MD. In comparing NA22 with 
UA5 data it was found that n increases and k parameter decreases in full phase- 
space as the cm. energy becomes larger, whereas at fixed cm. energy n and k 
become larger with the increase of (pseudo)rapidity interval. ISR, TASSO and EMC 
collaborations data on P n vs. n followed within a short time and confirmed the 
success of their description by means of NB (Pascal) MD [19l , |20| , 21] . The fact 
that so many experiments in a so wide energy range and in symmetric (pseudo)- 
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rapidity interval could be fitted by the same n charged particle MD was considered 
not accidental. The impression was that one was facing an approximate universal 
regularity. 

A suggestive interpretation of the regularity was then proposed: it implied that 
the dynamical mechanism controlling multiparticle production in high energy col- 
lisions is a two-step process. To the independent (Poissonian) production of groups 
of ancestor particles (called "clan ancestors") follows their decay according to a 
(logarithmic) hadron shower process (the particle MD within each clan). Clans 
are by definition independently produced and, by assumption, exhaust all existing 
particle correlations within each clan (the "clan" concept was introduced in high 
energy physics at the XVII International Symposium on Multiparticle Dynamics 
[122]). This interpretation gave a sound qualitative description of different classes of 
collisions in terms of the two new parameters, the average number of clans, N, and 
the average number of particles per clan, n c , two non-trivial functions of standard 
NB (Pascal) MD parameters n and k (the study of N and n c constitutes what is 
known as clan structure analysis, described in more detail in Section ^21) . 

It turned out that within this analysis the average number of clans was larger in 
e + e~ annihilation than in pp collisions, but the average number of particles per clan 
was smaller in e + e~ annihilation than in pp. The situation was intermediate between 
the last two in deep inelastic scattering (the average number of clans was smaller 
than in e + e~ annihilation and similar to the average number in pp collisions, but the 
average number of particle per clan was more numerous than in e + e~ annihilation.) 
Larger clans in pp collisions with respect to those in e + e~ annihilation were inter- 
preted as an indication of a stronger colour exchange mechanism in the initial state 
of the collision in hadron-hadron collisions with respect to e + e~ annihilation. The 
advent of QCD as the theory of of strong interactions — the non-linear field theory 
foreseen by W. Heisenberg — raised a new question, i.e., how to reconcile observed 
final n charged particle MD's in various collisions, all described by NB (Pascal) 
MD's, with QCD expectations for final parton MD's. 

The problem was considered quite challenging and intriguing in view of the fact 
that, in solving QCD Konishi-Ukawa-Veneziano (KUV) parton differential evolu- 
tion equations in the leading-log approximation with a fixed cut-off regularization 
prescription, jets initiated by a quark and a gluon were found to be QCD Markov 
branching processes controlled by quark bremsstrahlung and gluon self-interaction 
QCD vertices, and final parton multiplicity distributions were in both cases found to 
be again NB (Pascal) MD's [123], 24]. In order to solve the puzzle, in consideration 
of the lack of explicit QCD calculations at final parton level, the suggestion was 
to rely on Monte Carlo calculations based on Dokshitzer-Gribov-Lipatov-Altarelli- 
Parisi (DGLAP) integral QCD evolution equations (the integral version of the KUV 
differential QCD evolution equations) and on a convenient guesswork as hadroniza- 
tion prescription. It was found [251 , 26 ], by using the Jetset 7.2 Monte Carlo, that 
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NB (Pascal) MD occurred both at final hadron and parton level multiplicity distri- 
butions for qq and gg systems at various cm. energies and in symmetric rapidity 
intervals. Results at parton and hadron levels turned out not to be independent and 
their relation summarised in the so called 'generalised' local hadron(h) - parton(p) 
duality (GLPHD), i.e., k h = k p and n h = pn p . The name 'generalised' came from 
the need to distinguish the present 'strong' result from the standard 'weak' local 
hadron parton duality (LPHD) which requested in its first version that only the sec- 
ond of the two equations be satisfied. Others unexpected regularities emerged when 
Monte Carlo results were analysed in terms of clan structure analysis |[25|,|26(]. 

Apparently the situation was quite well settled, but as it very often happens in 
the continuous dialog between theory and experiment (the main characteristic of 
the field), new experimental facts were just around the corner ready to question 
the universality of the regularity, i.e., the statement that final n charged particle 
multiplicity distributions in all classes of collisions both in full phase-space (FPS) 
and in symmetric (pseudo)-rapidity intervals are NB (Pascal) MD's. Experiments at 
top CERN pp Collider energy (UA5 Collaboration, [1271]) and in e + e~ annihilation 
at LEP energies (Delphi Collaboration [|28|, 29]) showed a shoulder structure in 
P n vs. n plots for the total sample of events both in FPS and in larger (pseudo)- 
rapidity intervals. In conclusion, the regularity was violated as the cm. energy of the 
collision increased. Since the regularity, in view also of the quite large experimental 
errors, has been always considered to be true in an approximate sense, a better 
analysis of the existing data, together with the new data on the shoulder structure of 
P n vs. n plot, led to guess that these new facts were signals of substructures arising 
as the cm. energy increased. 

Then an amusing situation happened: the regularities in terms of NB (Pascal) 
MD could be restored at a more fundamental level of investigation, i.e., at the level 
of the different classes of events contributing to the total sample (for instance the 
classes of soft and semi-hard events in pp collisions and the classes of the two- and 
the three-jet sample of events in e + e~ annihilation) [30. l3lL |32[|. 

The result was that the weighted superposition mechanism of different classes 
of events (each one described by a NB (Pascal) MD with characteristic parameters 
n and k) in different collisions allowed to describe quite successfully, in addition to 
the shoulder effect in P n vs. n, also the observed H n vs. n oscillations (where H n 
is the ratio of n-factorial cumulants, K n , to n-factorial moments F n and the trun- 
cation effect had been properly taken into account [32]) and the general behaviour 
of the energy dependence of the strength of forward-backward (FB) multiplicity 
correlations (MC's) Q. 

Finally, extrapolations of collective variables for the different classes of events 
based on our knowledge of the GeV energy domain became possible in the TeV 
region accessible at LHC. Expected data on pp collisions at LHC will test these 
predictions. 
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Clan concept emerges as fundamental in this new context and its relevance in the 
theoretical interpretation of the above mentioned results leads to the conclusion that 
it would not be too bold to ask to experiment and theory respectively the following 
two questions: 

Are clans real physical observable quantities? 

What is the counterpart of clans at parton level in QCD? 

In addition, in our approach, signals of new physics at LHC in restricted rapidity 
intervals are foreseen in the total sample of events for P n vs. n (elbow structure) 
as a consequence of the reduction of the average number of clan to few units in an 
eventual third class of events (described by a NB (Pascal) MD with k < 1). The 
new class should be added to the soft and semi-hard ones. All together, the above 
mentioned facts form what has been called "the enigma of multiparticle dynamics", 
which we would like to disentangle in this paper. 

The last word is once more to experimental observation, of course, and is chal- 
lenging for QCD! The problem in multiparticle phenomenology can be summarised 
in fact in the following simple terms. 

A rigorous application of QCD is limited to hard interactions only. They occur 
at very small distances and large momentum transfers among quarks and gluons (the 
elementary constituents of the hadrons), and in very short times. Perturbative QCD 
can be applied with no limitations in these regions. A situation which should be 
contrasted with the fact that the majority of strong interactions are soft; they occur in 
relatively large time interval and distances among constituents, and large transverse 
momenta. The search should be focused on how to build a bridge between hard and 
soft interactions, and between soft interactions and the hadronization mechanism, 
i.e., on how to explore regions not accessible to perturbative QCD, like those with 
high parton densities. Along this line of thought it is compulsory to isolate in these 
extreme regions the properties of the sub-structures or components or classes of 
events contributing to the total sample of events, being aware of the fact that the only 
informations we can rely on in a collision — as already pointed out — are coming 
from the hadronic spectrum and multiparticle production. The present work (limited 
to multiparticle dynamics) has been motivated by the need to review experimental 
facts and theoretical ideas in the field, ideas which, after more than thirty years 
from their introduction in the accelerator region, are still with us today and arouse 
a special interest Q34L [35], [36], \37\ \. 

The experimental facts we are referring to are indeed very important in estab- 
lishing collective variables behaviour in full phase-space and restricted rapidity in- 
tervals and represent the natural starting point for the investigations of the new 
horizon, opened at CERN and RHIC, in the TeV domain for hadron-hadron and 
nucleus-nucleus collisions. 

On the theory side, the ideas we propose to examine are still quite stimulating 
and matter of debate in view of both their success in describing, with good approxi- 
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mation, important aspects of multiparticle phenomenology of high energy collisions 
and of their connection with QCD parton showers. The search started a long time 
ago and took advantage of the encouragement and great interest of many scientists; 
of particular relevance has been the contribution of Leon Van Hove who, we are 
sure, would be very glad to see how far some ideas elaborated together went. 

In Section^ in order to enlarge the scope of our work from a phenomenological 
point of view, we decided to discuss the main statistical properties of the collective 
variables of the class of MD's to which NB (Pascal) MD belongs, i.e., the class 
of Compound Poisson Multiplicity Distributions (CPMD). Within this framework 
particular attention is paid to clan concept and its generalisations in view of its rel- 
evance in our approach to multiparticle phenomenology. Clan structure properties 
are always exemplified in the case of the NB (Pascal) MD. 

In Section \3\ QCD roots of NB (Pascal) MD are examined by studying jets at 
parton level as QCD Markov branching processes in the framework of QCD KUV 
equations. Then an attempt is presented to build a model of parton cascading based 
on essentials of QCD (gluon self-interaction) in a correct kinematical framework. 
This study led us to the generalised simplified parton shower model (GSPS) with 
two parameters. 

In Section 31 global properties of collective variables in multiparticle dynamics 
in various experiments are discussed and their descriptions and interpretations in 
terms of clan structure analysis are presented. Attention is focused especially on n 
charged particle MD's general behaviour in FPS and in (pseudo)-rapidity intervals 
in pp collisions and e + e~ annihilation, on the oscillation of the ratio of n-order fac- 
torial cumulants to factorial moments when plotted vs. n and on forward-backward 
multiplicity correlations. This study is extended in pp collisions to different scenar- 
ios in the TeV energy domain and has been obtained by extrapolating our knowl- 
edge of the collisions in the GeV region. It is shown that the weighted superposition 
mechanism of different classes of events, each one described by a NB (Pascal) MD 
provides a satisfactory description of all above mentioned, more subtle aspects of 
multiparticle phenomenology. Clan structure analysis turns out to be quite impor- 
tant in this respect. Its success raises a natural question on the physical properties of 
clans. New perspectives opened by the reduction of the average number of clans in 
the semi-hard component of the scenarios with Koba-Nielsen-Olesen (KNO) scal- 
ing violation are examined. Signals of new physics in pp collisions in the TeV 
region (ancillary to those expected for heavy ion collisions) are discussed at the 
end. 



7 



2 COLLECTIVE VARIABLES AND CLAN STRUCTURE ANALYSIS 

In studying and testing perturbative QCD, two complementary approaches can be 
used: one can select infrared-safe quantities in order to test pQCD in the hard re- 
gion, where calculations are well defined and non-perturbative corrections are sup- 
pressed. Alternatively, one can examine infrared- sensitive observables with the aim 
of testing the validity of the theory in the long-range region, where confinement 
becomes dominant. It could be said that the latter approach determines the bound- 
ary conditions that the former one must satisfy. The present work takes the road 
of the infrared-sensitive observables, which are discussed in detail in this Section 



2.1 Observables in multiparticle production. The collective variables. 
In a n-particle production process in the collision of particles a and b 

a + 6— ►cH + Cn, (1) 

by assuming for simplicity that all produced particles c$ are of the same species, 
the n-particle exclusive distribution P' n {yi, ■ ■ ■ , y n ) m a sub-domain of phase-space 
(the jji with i — 1 . . . n are the particle (pseudo)-rapidities) is described in terms of 
exclusive cross sections, i.e., 

d n n 

P^y 1 ,...,y n ) = (a msi y 1 - n — (2) 

dy 1 ... dy n 

and is fully symmetric in its variables. P^(yi, . . . , y n ) is related to the probability of 
detecting n particles at rapidity variables j/i, . . . , y n with no other particle present 
in the subdomain. 

The corresponding integrated observable in the rapidity interval Ay, (i.e., one 
integrates over yi E Ay, i = 1, . . . , n) describes the probability P n (Ay) of detecting 
n particles in the mentioned interval and is the n-particle exclusive cross section, 
o n , normalised to the total inelastic cross section, ai ne i: 

P n (Ay) = ^ = (n\y 1 [ ... [ P' n {y l ,...,y n )dy l ---dy n . (3) 

^inel >^Aj/ ^ Aj/ 

The n-particle inclusive distribution, Q n {yi, ■ ■ ■ , yn), in the reaction 

a + b — > c x + h c n + anything (4) 

describes the finding of n particles at y±, . . . , y n , without paying attention to other 
particles in the same sub-domain, in terms of the n-particle inclusive cross-section 
density, i.e., 

d n a 

Qn(Vl, ■ ■ ■ , yn) = (o-inel)" 1 ^ 1~ (5) 

dyi ■ ■ ■ dy n 
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(sometimes referred to in the literature as p(yi, ■ ■ ■ , y n )-) 
The corresponding integrated observables in Ay, 

F n (Ay) = ... j Q n (y 1} y n )dyi ■ ■ ■ dy n , (6) 

J Ay J Ay 

are the un-normalised n-factorial moments of the multiplicity distribution in Ay 
(the so-called binomial moments). For instance the average multiplicity in Ay, 

n(Ay), is 

F l (Ay)=n(Ay)= f Qi{y x )dy x =f ^-d Vl (7) 

J Ay J Ay "2/l 

In the integration over the domain Ay each particle will contribute once to the n- 
particle event and the event will be counted n times. Coming to the second order 
factorial moment, F 2 (Ay) — {(n — l)n), each ordered couple of final particles will 
be counted once in one event and the event will be counted 2 (™) times. 

The n-particle inclusive distributions Q n (yi, • • • , Un) contain inessential contri- 
butions due to combinations of inclusive distributions of lower order. In addition, 
for large n, it is hard to measure all Q n (yi, . . . , y n ) and a more essential informa- 
tion is demanded. Accordingly, a new set of observables is introduced, the set of 
n-particle correlation functions C n (yi, y n ) which reminds of cluster-expansion 
in statistical mechanics: 

Qi{yi)=C 1 {y 1 ) 

^2 (j/1,2/2) = 62(2/1,2/2) + ^1 (2/l)Cl(2/2) 

Qsiyi, 2/2, 2/3) = 63(2/1, 2/2, 2/3) + C 1 (y 1 )C 2 (y 2 , 2/3)+ 

61(2/2)62(2/3,2/1) + 6i(y 3 )C 2 (2/i,2/2)+ 



(8) 



The reverse is of course always possible and one can express C n (yi, . . . , y n ) 
variables in terms of Q n {vii ■ ■ ■ , Un) variables: the n-particle correlation functions 
C n (yi, ... , 2/n) give precious informations on the production process: 

n 

6„(2/i, . . . ,2/n) = Q n (yi,...,y n ) = Y[Qi(.Vi) ( 9 ) 



8=1 



Produced particles are Poissonianly distributed. 



C n (y u ...,y n ) >0 (10) 
Produced particles follow a distribution wider than a Poisson distribution. 

C n (y u ...,y n )<0 (11) 
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Produced particles follow a distribution narrower than a Poisson distribution. 

The corresponding integrated observable are called factorial cumulants of the 
multiplicity distribution and indicated in the literature with K n . They give infor- 
mations on the shape of the multiplicity distribution when plotted vs. n, i.e., on its 
dispersion ((n — n) 2 ), skewness ((n — n) 3 ), kurtosis ((n — n) 4 ), 

Both F n and K n are sensitive to events with many particles and control the tail 
of the multiplicity distribution. In K n are subtracted the Q n -i, •••> Qi correlations 
present in Q n , not those related to the fluctuations in the multiplicity, as lucidly 
discussed in [[41 J. 

Usually normalised differential collective variables 

Qn = Qn(yi, ■ ■ -,yn)/Qi(yi) ■ ..Qi(y n ), (12) 

C n = C n ( yi , . . . , y^/CM . . . C^) (13) 
and the corresponding normalised integral collective variables: 

F n = F n (Ay)/F 1 (Ay) . . . F^Ay), (14) 



K n = K n {Ay)/K 1 {Ay) . . . K^Ay) (15) 

are used. 

In the literature also n-particle combinants W n (Ay) are defined. Their expres- 
sion is a function of n-particle multiplicity distribution P n (Ay) according to the 
following recurrence relation: 

^A,) = ^-i|>^)^M, (16) 
P Q {Ay) Po{Ay) 

i.e., W n are "finite combination" ratios of P n to Pq. The name combinants comes 
from this fact. 

Combinants in their normalised form, W n are 

w (\ \ ^n(Ay) 

Wn{Ay) = W^yjT- ( ?) 

Notice that combinants are linked to factorial cumulants K n i.e. 

w „ (Ay) = ^ (-D'WA.) (18) 

i=0 
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Of particular interest is the relation which allows to express P (Ay) in terms of 
the combinants: 



P (Ay) = exp 



71=1 



(19) 



From the definition and properties of combinants it turns out that they are sen- 
sitive to the head of the multiplicity distribution whereas factorial cumulants and 
moments are sensitive to the tail of the multiplicity distribution. 

The relation among collective differential variables P' n {y\ , . . . , y n ) , Q n (yi , . . . , y n ) 
and C n (yi, . . . , y n ) can be elegantly reformulated in terms of functional, i.e., of 
G[z(y)] with z(y) = 1 + u(y) and u(y) an arbitrary function of y. Within this 
framework, G[z(y)} is defined as follows 

G[z(y)]=P + J2( n] -r 1 / ••• / P' n {yi,---,yn)\{z{yi)dy i , (20) 

n=l J i=l 

i.e., the (exclusive) generating functional. It should be noticed that 

OO p ,. 

P = l-J2(n ] .r 1 ■■■ P^y 1 ,...,y n )dy 1 ...dy n (21) 

n =l "'Aj/i J Ay„ 

is the probability of events with zero particle multiplicity. 

The knowledge of the functional G[z(y)} allows to define two other functionals 
G[u{y)} and g[u(z)}. They are linked by the relation 

G[u(y)}=exp{g[u(y)}}, (22) 

where 

G[u(y)] = l + J2( n] -y 1 / ••• / Qn(yu...,y n )l[u(y l )dy l , (23) 

n=l ^ A ^™ i=l 

is the (inclusive) generating functional, and 

oo p p n 

9Hy)] = ^N)- 1 / . . . / C n ( yi , . . . ,y n ) Y[u( yi )d yi . (24) 

n=l ^ A ?/i J A y™ i=l 

It is remarkable that, in view of the existing connection among collective dif- 
ferential variables P' n , Q n , C n and their integrated partners P n , F n , K n , the gen- 
erating functions (GF's) of the integrated variables, obtained by the substitution 

z(y) — > z, are G(z), G(z + 1) and \ogG(z + 1) respectively. Accordingly, being 
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G{Z) = En P nZ U ^ follows 



Pr, = ^ 



1 d n G(z) 



n\ dz- 
d n G{z) 



z=0 



Bz n 

u4> z=l 

d n \ogG(z 



dz r 



(25) 
(26) 
(27) 



z = l 



In addition one gets 



G(z) =exp^— jl = JJexp 



n=l 



n=l 



IV. 



(28) 



Coming to multiplicity combinants W n (Ay) their relation with the exclusive 
generating function G(z; Ay) should be recalled: 



G(z; Ay) = exp 



£W»(A|/)Ci 



71=1 



(29) 



This relation shows that combinants are the coefficients of the power series expan- 
sion of the logarithm of the generating function. 

The above general definitions will now be exemplified for the NB (Pascal) MD; 
explicit expressions are given below, and also plotted in Fig. CD for typical values of 
the parameters (corresponding to the soft and semi-hard components which describe 
pp collisions data at 546 GeV cm. energy [42[| .) 

Being the GF of the NB (Pascal) MD 



Gnb(z) 



k 



k — (z — l)n 



(30) 



(with k~ l = (D 2 - n) /n 2 , D is the dispersion) it is found from Eq.s (gS), £26), (g7) 
that 

(nb) = k(k + l)---(k + n-l) n n k k 

n\ {n + k) k + n ' 



n-l 



JJf B > = n»n(l + «'A). 



(31) 
(32) 



i=l 



and 



n-l 



Al NB) = fi n 



\Ji/k. 



(33) 
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In the literature also the collective variables expressed in terms of the ratio of 
factorial cumulants and moments [[431], i.e., 

H n = K n /F n , (34) 

deserve great attention. For the NB (Pascal) MD one gets 

n-l 

H (NB) = TT - (35) 

Accordingly, -£^ NB) depends on k parameter only, an important fact which will be 
of particular interest in Section 13.31 

2.2 Clan concept and Compound Poisson Multiplicity Distributions. 

Clan concept, as already mentioned, has been introduced in order to interpret the 
wide occurrence of the NB (Pascal) MD in multiparticle phenomenology in differ- 
ent classes of collisions since 1987. In terms of the NB (Pascal) MD generating 
function, G NB (ri, k; z) = Yl^Lo^n z n , one has 

GNB(n, k; z) = exp {N[Gi og (n, k; z) - 1]} , (36) 

where N = fcln(l + n/k) is the average number of clans, n the average number 
of particles of the total NB (Pascal) MD and k the second characteristic parameter 
linked to the dispersion D by (D 2 — n)/n 2 — 1/k; the GF of the logarithmic MD, 

Gio g (n, k] z), is given by 

_ ln(n + k — nz) — ln(n + k) 
log = lnfc-ln(n + jfe) ' { ) 

and the average number of particles per clan, n c , turns out to be 
Notice that 

N = n/n c . (39) 

The interest of Eq. ([361 ) lies in the fact that it is indeed related to an important 
dynamical concept: the description of particle production as a two-step process. To 
an initial phase in which ancestor particles are independently produced (their mul- 
tiplicity distribution is Poissonian as predicted for instance by the multi-peripheral 
model) it follows a second phase in which Poissonianly distributed particles (the an- 
cestors) decay according to a given new multiplicity distribution (for instance, the 
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Figure 1: General behaviour of global variables P n , K n , F n , H n , W n vs. n for the NB (Pascal) 
MD for two sets of parameters (the pairs of chosen values correspond respectively to the soft and 
semi-hard components which describe pp collisions data at 546 GeV cm. energy as determined in 

&) 
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logarithmic distribution given by Eq. (137).) An ancestor particle in this framework 
is very special: it is an intermediate particle source and together with its descendants 
is called in the literature 'clan' ('Sippe' in German language; see also the Oxford 
Dictionary entry). Each clan contains at least one particle (its ancestor, the interme- 
diate particle source) by assumption, and all correlations among generated particles 
are exhausted within the clan itself by definition. The point is that G(z) in Eq. (36) is 
just by inspection the GF of a compound Poisson multiplicity distribution (CPMD), 
a much wider class of multiplicity distributions than the NB (Pascal) MD [ 144 , 45] , 
Accordingly, the first step of the production process corresponding to independent 
emission of the particle sources is Poissonian for all the multiplicity distributions 
belonging to the class of CPMD: what fully characterises final n-particle multiplic- 
ity distribution in this class are the multiplicity distributions of particles originated 
by the Poissonianly distributed particle sources. Therefore we propose to gener- 
alise clan concept and its properties in terms of collective variables to the full class 
of compound Poisson multiplicity distributions. The generalisation is interesting 
since it gives a larger horizon to multiparticle phenomenology in suggesting that 
particle multiplicity distributions need not to be in general of logarithmic type as in 
the case of the NB (Pascal) MD: in principle any true GF is allowed. 1 

In order to generalise clan concept to the class of CPMD, we suggest therefore 
to write the GF of a generic CPMD, Gcpmd(^), as follows Q46 ] 



Gcpmd(^) = exp {N g [g(z) - 1]} , (40) 

with Gcpmd(^) = Ylin P%~ PMD) z n , g(z) is here the generic GF of the multiplicity 
distribution of particles produced by the generalised clan ancestor and N g is the 
average number of generalised clans. The total number n of particles distributed 
among the N g generalised clans satisfies of course the condition Yl!i=i n c = n, 

(i) 

with n c the number of particles within the i-th generalised clan. The name "gener- 
alised clans" (g-clans) will be given to clans when the GF g(z) in the above equation 
is related to a generic multiplicity distribution g n , i.e. g(z) = ^ n g n z n , whereas 
the name "clans" refer properly to the grouping of particles defined within the NB 
(Pascal) MD, for which g(z) is the logarithmic distribution. 

It is clear that since particles within each clan must contain at least one particle 
(the clan ancestor) particle distribution within each clan must satisfy the condition 



0(*)|*=o = 0o = O. (41) 

'it should be pointed out that the generating function G(z) (and the corresponding MD P n ) 
belongs to the class of Infinitely Divisible Distributions (IDD) GF's if for every integer number s 
there exist s independent random variables with the same GF g s (z) such that G(z) = [g s (z)] s . It is 
remarkable that s can be in general a positive non integer value and that all GF's of discrete IDD's 
can be written as compound Poisson distribution GF's [[460. 
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The description in full phase- space can be extended to restricted regions of 
phase-space, i.e., for instance to the set of rapidity intervals Ay. One has 

G C pmd(2, Ay) = exp {N g . clan (Ay)[g(z, Ay) - 1]} , (42) 

and, contrary to Eq. ( 141) , g(z, Ay)\ z=a = go(Ay) is different from 0; g(z, Ay)\ z=0 
is the probability that a g-clan does not generates any particle in Ay. 

In order to fulfill the request that at least one particle per clan will be produced 
in the interval Ay a new renormalised GF g(z, Ay) should be defined 

1 - g {Ay) 



Accordingly, 



It follows 

and 



G CP md(z, Ay) = exp {N g . clan (Ay)[g(z) - 1]} 

In (A Mz,Ay)-l \ (44) 

N g . clan (Ay) = N g . clan [l - g (Ay)} (45) 



g(z,Ay)\ z=0 = 0. (46) 

It should be stressed that knowing the probability of generating zero particles in 
the interval Ay, the average number of generalised clans in the same interval can be 
determined from the knowledge of the average number of generalised clans in FPS 
(this will be used in Section 13.2.31 ) 

In conclusion, by considering only those clan which generates at least one par- 
ticle in the interval Ay the GF Gcpmd(z, Ay) as given by Eq. ( [441 ) satisfies the 
standard clan definition also in the case of generalised clans. 

Now we will discuss two interesting theorems that characterise the class of 
CPMD. 

i. A MD is a CPMD iff the probability of producing zero particles, P , is larger 
than zero and all its combinants W n are positive definite. The theorem follows from 
the fact that combinants W n are related to the n-particle multiplicity distribution 
within a g-clan, p n , by the equation 

W n = N g . clanPn , (47) 

and that the average number of g-clan iV g . clan is related to combinants W n by 

oo 

iV g . c l an = -logPo = ^Wn. (48) 

n=l 
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0471] . The connection between N g . C [ an or P and the combinants is particularly sug- 
gestive. 

ii. All factorial cumulants of a CPMD are positive definite. Being for a 
CPMD n-order factorial cumulants, K n , related to n-order factorial particle mo- 
ments within the g-clan, q n , by 

K n = N g . clan q n , (49) 

Pq > and iV g . clan a finite number, positive definiteness of n-order factorial mo- 
ments, q n , implies that also K n are positive definite. The theorem can be applied 
to H n variables, i.e., to the ratio of n-order factorial cumulants to n-order factorial 
moments K n /F n . The theorem is of particular interest in the study of H n vs. n 
oscillations [48J. 

The mentioned relations between the normalised n order cumulants of the total 
CPMD, K n , and the normalised n order factorial moments of the particle multiplic- 
ity distribution within the g-clan, q n , can be easily extended from full phase-space 
to a generic rapidity interval Ay as follows 

K n (Ay) = g n (Ay)/iV g . clan (Ay) (n - 1) . (50) 

Along this line multiplicity combinants of a CPMD W n (Ay) and n particle MD 
within the g-clan are related as in f.p.s . 

W n (Ay) = N g . clm (Ay)p n (Ay) : (51) 

with 

oo 

AWAy) = ^^ n (Ay). (52) 

n=l 

It should be pointed out that, if P = or one of the combinants W n or one of the 
factorial cumulants K n is less than 0, then the MD cannot be a CPMD. 



2.3 Hierarchical structure of factorial cumulants, rapidity gap events and CPMD 's 

From the discussion on collective variables properties in CPMD's, it has been seen 
that a special role is played by P (Ay), i.e., by the probability of detecting zero 
particles in the rapidity interval Ay. 

In fact, it is clear from results of Sec. \2.2\ that, thanks to the knowledge of 
P (Ay), the n-particle multiplicity distribution P n (Ay) can easily be determined 
according to the following iterative equation: 

P n (Ay) = (-nAyr^P (Ay), (53) 
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and that there exists an instructive connection of Po(Ay) with factorial cumulants 



P (Ay) = exp |^[-n(Ay)]"ij^ B (Ay)| . (54) 

where 

K n (Ay) = K n {Ay)/[K 1 {Ay)\ n . (55) 

Following the just mentioned relations, a new variable in terms of Po(Ay) can 
be defined which turns out to be of great importance in the study of n-particle cor- 
relation structure, i.e., the void function Vo(Ay), 

Vo(Ay) = -l/n(Ay) \ogP (Ay) 

n=l 

In fact it can be shown that when plotted as the function of n(Ay)K 2 (Ay) the 
void functionVo(Ay) scales iff n-order normalised factorial cumulants, K n (Ay), 
can be expressed in in terms of second-order normalised factorial cumulants, K 2 (Ay) , 
i-e., 

K n (Ay) = A^K^Ay)}^ 1 (57) 

The A n are energy and rapidity independent factors; they are determined by 
the correlation structure of n-particle MD. When Eq. (1571) is satisfied one talks of 
"hierarchical structure for cumulants." 

The properties of the void function variable can be easily generalised to the class 
of CPMD's. It turns out indeed that 

VVpMo(Ay) = l/n c , g . clan (Ay) = (58 ) 

^CPMD(Ay) 

being 

P CPMD (Ay) = exp[-iV g . clan (Ay)]. (59) 

In view of the connection between normalised cumulants, K n , and the nor- 
malised 72-particle correlation functions, C n , the hierarchical structure prescription 
in a symmetric rapidity interval Ay on K n can be translated in terms of C n variables 

C n (yi, ...,y n ;Ay) = ^2 A n>olt ^ ^{y^Vn] Ay) . . . C 2 (y in _ 1 ,y ln ; Ay) (60) 

at u 

where the sum over a denotes all non symmetric relabelings of the n particles; A nm 
are coefficients independent on the cm. energy and rapidity interval Ay; a t labels 
the different ways of connecting the n particles among themselves [49] . 
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star graphs: 




Figure 2: Graphical representation of correlation functions of different order (left column) in the 
hierarchical scheme. At order 4, the LPA and the VHA begin to differ: the LPA includes only the 
'snake' graphs, while the VHA includes also the 'star' graphs [50]. 
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A graphical description of the possible connections turns out to be quite use- 
ful. A two-particle normalised correlation function C 2 (yi,yj] Ay) is associated to 
an edge linking particles of rapidity yi and rapidity yj\ a graph with (n — 1) edges 
is generated for each C n (yx, . . . , y n \ Ay) configuration of distinct particles as de- 
scribed by Eq. ( [601 . The sum over a t should be read as the sum over all topolog- 
ically distinct connected n-graphs (see Figure [2). Among the different models of 
correlations functions let us mention two of them: 

i) the linked pair ansatz model (LPA) [51]. In this model a particle cannot appear 
more than twice in a product and a relabeling reduces to a standard permutation, in 
conclusion only a t = "snake" graphs are allowed (see Figure |2) 

Cn(yi,---,y n ;Ay)\ LPA = A„ >snake ^ C 2 (y h ,y i2 ;Ay) . . .C 2 (y in _ 1 ,y in ;Ay); 

permutations 

(61) 



ii) the Van Hove ansatz model (VHA) [52]. In this model, in addition to "snake' 



graphs, also graphs with a particle linked to three other particles are allowed, the so 
called "star" graphs (a t = "snake", "star") and Eq. ([60]) can be re-expressed with a 
recursion relation: 

C n+ i(yi, y n +i; Ay) | VH a = n[C n (yi, ...,y n ; Ay)C 2 (y u y n +i] Ay)] s , (62) 

where [...]s means symmetrization over all particles. 

In general, by integrating Eq. ([60] ) — in particular, both the LPA and VHA 
cases — over the central rapidity interval Ay, assuming translation invariance of 
the C [[40], one obtains Eq. ( [57]) , which defines hierarchical structure for cumulant 
moment; thus it is apparent that hierarchical structure for cumulants is not sufficient 
to discriminate among different ansatz for correlation functions. 

In order to test hierarchical structure for normalised cumulant moments, the void 
function Vo(Ay) can be used. Since 

oo . 

V (A y ) = -^[-n{Ay)K 2 {Ay)r\ (63) 

n=\ 

energy and rapidity dependence are confined here in the product n(Ay)K 2 (Ay), 
i.e., the void function scales for hierarchical cumulant moments with n(Ay)K 2 (Ay) . 
On the contrary, if scaling holds, i.e., V = V (n(Ay)K 2 (Ay)), Eq. ( 163]) follows 
with 

A, =„(-!)-' 



i\n(Ay)K 2 (Cy)}^ „ fe , ^ 

energy and rapidity independent. 

For a CPMD's, P (Ay) and V (Ay) variables are completely equivalent to 

N g (Ay) and 

^c,g-clan 

(Ay), being 

N g (Ay) = -\ogP (Ay) (65) 
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and 

^ c ,g-cian(Ay) = [Vo(Ay)]- 1 . (66) 
In case of the NB (Pascal) MD, 

K n (Ay)\ m = (n - l)\[k(Ay)}- n+1 , (67) 

being 

K 2 (Ay)\ NB = l/k(Ay) and A n = {n-l)\. (68) 
The scaling function V (Ay) turns out to be 



v <A^MM, og ( 1 + ^). 

A second important application of P (Ay) resides with jet production in hadron- 
hadron collisions: when jet production is associated with a colour- singlet exchange 
process, one expects a signature of rapidity gap events [53], while the production as- 
sociated with colour-octet exchange (i.e., the one which is the focus of this review) 



will produce a much lower rate of empty rapidity intervals [541] . The difference be- 
tween the two processes could lie in the different internal structure of clans [1551]: in 
the colour-octet exchange, clans decay with a logarithmic distribution, as discussed 
in the previous section; in the colour- singlet exchange, clans decay with a geometric 
distribution. In other words, in both cases Eq. (59) remains valid, but with different 
values of iV g . c i an . It was found that such a scheme can describe well the excess of 
low multiplicity events in the colour- singlet exchange process [55]. 



2.4 CPMD's, truncation and even-odd effects. 

The maximum number of observed particles in the n charged particle multiplicity 
distribution, P n , never exceed a given number n , which of course increases with 
the cm. energy of the collision. Theoretically, at least from energy-momentum 
conservation, this number is related to the available energy and the mass of the 
lightest charged particle, the pion. In experiments, there is in addition a practical 
limit (usually much lower than the theoretical one) related to the luminosity and 
total cross-section, i.e., dependent on the statistics. In order to guarantee that the 
probabilities sum to one, from the original (non-truncated) distribution P n a new 
MD is defined, P n , 

{AP n if n < nn 
n . f , (70) 

if n > n 

such that J2nLo Pn — 1- A is the normalisation factor. It follows from their def- 
inition that factorial moments F n are equal to zero for n > n , whereas factorial 
cumulants K n are different from zero for any n. It should be pointed out that a 
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Figure 3: General behaviour of global observables P„, K n , F n , H n and W n as in Fig. CD but for 

truncated NB (Pascal) MD's. 
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even-only multiplicity distribution 



normalised factorial moments 




solid line: rr = 24 k = 7 
dashed line: n = 47.6 k = 14 
even multiplicities only 



Figure 4: General behaviour of global observables P n , K n , F n and H n as in Fig. [Jbut for even-only 
NB (Pascal) MD's (combinants are not shown as they are not relevant in this context.) 
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truncated MD is not a CPMD and that at least one of its combinants W n is negative 
(for n > no.) Accordingly, factorial cumulants F n and the ratio of factorial cumu- 
lants to factorial moments, H n , of a truncated MD are not positive definite. Notice 
that combinants of the truncated MD are not affected by the truncation process for 
n < n (combinants are sensitive indeed to the head of the distribution and only 
to ratios P n / Pq.) The truncation process becomes relevant for factorial cumulants, 
K n , and for the ratio of factorial cumulants to factorial moments, H n . This fact 
reflects the peculiar property of K n and H n to be sensitive to the tail of the MD. As 
an illustrative example, in Fig. [3] we present the same observables as in Fig. U for 
the case of a truncated NB (Pascal) MD. Of particular interest are the zeros of the 
generating function of a truncated MD (the GF in this case is a polynomial of order 
n ). 

Another conservation law which has to be obeyed is charge conservation: in an- 
nihilation events, only even multiplicities can be produced in FPS, while in narrow 
intervals this restriction is negligible (the so-called even-odd effect). One must also 
in this case re-normalise the MD: 



so that J2^=o = 1- F° r completeness, in Fig. 33 we present the same observables 
as in FigJT] for a complete NB (Pascal) MD in which only even multiplicities are 
non-zero. 




(71) 
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3 COLLECTIVE VARIABLES AND QCD PARTON SHOWERS 



3. 1 Parton showers in leading log approximation 

Because our aim is to study the emission of soft gluons, fixed-order calculations 
in the coupling constant a s are insufficient and a resummation of perturbative dia- 
grams is needed. The standard approach [ 156 ] is to select the set of diagrams which 
dominate the perturbative series at high energies; since in the expansion one en- 
counters physical amplitudes proportional to a™(Q 2 ) log m (Q 2 ) with m < n, the 
leading terms are those in which m — n, which give the so-called leading log ap- 
proximation (LLA). 

In general, the factorisation theorem for collinear singularities allows to build 
a parton model description of the production process. For example, in the case of 
e + e~ annihilation into hadrons, the common wisdom suggests the following scheme 
(see Fig. |5): the electron and the positron annihilate into a virtual particle (a photon 
or a Z°) which then decays into a quark-antiquark pair; this part is governed by 
the electroweak interaction theory. Perturbative QCD (pQCD) then describes the 
emission of further partons (mostly gluons) until the virtualities involved become 
too small: in this "soft region" where perturbation theory cannot be applied the 
produced partons merge with very soft gluons to form hadrons, often large mass 
resonances which then decay according to standard model rules. 

Let us now go into more details: the single-inclusive cross section at hadron 
level a(e + e~ — > hX) can be expressed in terms of a perturbative elementary cross- 
section a p (e + e~ — > qiqi) for a hard process at scale Q 2 , and of a fragmentation 
function D^(z,Q 2 ) which is interpreted as the probability of finding a hadron h 
in the fragmentation of a parton of flavour i, carrying a fraction z of the parton 
momentum: 



where the sum runs over all flavours. Fragmentation functions, like structure func- 
tions in deep inelastic scattering, are universal and not calculable with perturbative 
methods. However, they can be expressed in terms of a Q 2 -dependent partonic 
fragmentation function, D\ (z, Q 2 , Q 2 ), and a universal hadronic function at a fixed 



Furthermore, the evolution with the hard scale can be calculated and is given by the 
Dokshitzer-Gribov-Lipatov-Altarelli-Parisi (DGLAP) equations: 




(72) 



(soft) scale Q s , 



(y,Qs), 




(73) 




(74) 
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B CD 



Figure 5: Schematic view of electron-proton annihilation into hadrons. (A) electroweak step (scale: 
10~ 17 cm); (B) hard processes, described by perturbative QCD (scale: 10 -15 cm); (C) soft non- 
perturbative processes (hadronization, scale: 10~ 13 cm); (D) decay of resonances into observable 

hadrons (mostly pions). 



where Pjk(z) is the DGLAP elementary kernel for the emission of parton k from 
parton j, with parton k carrying a fraction z of fs momentum. These kernels can 
be computed with elementary perturbative methods p6]. 

It is important to stress the simple physical meaning of DGLAP evolution equa- 
tions in LLA: in the axial gauge, this approximation corresponds to select dressed 
ladder diagrams without interference terms and with strong ordering in the virtu- 
ality and transverse momentum of the offspring partons. It is this feature which 
allows to interpret the production process in a probabilistic language, in terms of 
a branching process. In this language, the elementary DGLAP kernel P a b(z) gives 
the probability density for a parton a to emit a parton b which carries a momentum 
fraction z. 

Taking into account also the virtuality, we are led to a bi-dimensional elementary 
probability for the splitting of parton a producing parton b with momentum fraction 

z: 

a s (Q 2 )dQ 



2tt Q 



P ab (z)dz. (75) 



In order to normalise this probability, one has to re-sum all virtual corrections, i.e., 
in the parton shower language, to take into account the probability that no parton is 
emitted at virtualities larger than Q; such probability is given by the Sudakov form 
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factor: 



{f w n iK 2 \ f Zmax 



dK 2 
K 2 



(76) 



Here the integration limits z m i n and z max depend on the kinematics of the splitting: 
this means that in general they (and consequently the whole expression in brackets) 
depend on the virtualities of both offspring partons and not only on K: this makes 
a closed form for S a (Q\W) impossible to find, unless some simpler approximation 
(like the fixed cut-off introduced below) is used. 

Thanks to this probabilistic partonic picture, the result for single-inclusive dis- 
tributions can be generalised using the "jet-calculus" rules [241 ] for n-parton frag- 
mentation functions. 

It should be noticed that both collinear and infrared singularities are present in 
the LLA expression of DGLAP kernels: the former ones are avoided by imposing 
a soft cut-off on the evolution (Q s > 0). The simplest way of curing infrared 
divergences is to impose a fixed (i.e., virtuality independent) cut-off on z, say z m { n = 
e' = 1 — z max ; then one can simply interpret the integral of the regularized kernels 
as elementary splitting probabilities 

P g9 {z)dz = ^ = !h (77) 



l-e' 



Cf N 2 



P gq {z)dz = -f = (78) 

/ 1_£ ' N f 
B = N f P qg {z)dz = -f (79) 



withe = (-21ne') _1 . 

We will use the so-called "jet thickness" y as evolution variable: 

y= 2ri l ° g ) = 2^b l0g [ lo g (Q 2 /A 2 ) ) ' (80) 

from virtuality W down to Q, where b = (11N C — 2Nf)/l2ix and we used the 
leading order expression 

a ° (Q2) = b log(Q 2 /A 2 ) " (8l) 

Then the probability P q (Q\W)dQ that a quark of virtuality W splits at a virtuality 
in the range [Q,Q + dQ] (by emitting a gluon) is given by 

P q {Q\W)dQ = e~ Ay Ady, (82) 
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and the probability P g (Q\W)dQ that agluon splits (by either emitting another gluon 
or a quark-antiquark pair) by 

P g {Q\W)dQ = e~ {A+B)y {A + B)dy. (83) 

Neglecting conservation laws, the last two equations imply that the splitting proba- 
bility is constant for each dy interval: this allows to classify the process as Marko- 
vian, and therefore to write the appropriate forward and backward Kolmogorov 
equations for the probabilities to create n q quarks and n g gluons from an initial 
quark, P q (n q ,n g ; y), or an initial gluon, P g (n q ,n g ; y), at thickness y [23]. The 
corresponding non-zero transition probabilities in an infinitesimal interval dy are 
as follows: 

(n q , rig) — > (n q , n g ) = 1 — An g dy — An q dy — Bn g dy; 
(n q , rig) — > (n q , n g + 1) = An g dy + An q dy; (84) 
(n q ,n g ) -> (n q + 2,n g - 1) = Bn g dy. 

It is simpler to use the generating functions (GF's): 

G a (u, v; y)=J2 u n «v n °P a (n q , n g ; y) (85) 

n q ,n g 

where a = q,g. One obtains the following differential equations [H]: 

^ = A(Gl-G g ) + B(G 2 q -G g ), (86) 
^ = AG q (G 9 - 1). (87) 

At low energy, the production of quark-antiquark pairs is negligible, and one 
can study solutions setting B = 0: by looking only at the evolution of the number 
of gluons, the above equations decouple and the result is 

G g {u,v-y) = v[v + (1 - v)e Ay ]- 1 (88) 
G q (u, v; y) = u[v + (1 - v)e Ay ]- A ' A (89) 

The gluon multiplicity distribution (MD) in a gluon-initiated shower is found to 
be a shifted geometric distribution with average multiplicity e Ay while the gluon 
MD in a quark initiated shower is a NB (Pascal) MD with average multiplicity 
n = A(e Ay — I)/ A and parameter k = A/ A, which for ^-regularization is just 
(N% — 1)/2N 2 (i.e., 4/9 with 3 colours). Clans in the quark jet case can be defined 
as in Section [22] and we obtain 

e Ay - 1 

N = Ay; n c = ~^ r - (90) 
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The two vertices: gluon production from a quark, controlled by parameter A, and 
gluon emission from a gluon, controlled by parameter A, have been separated and 
found to correspond respectively to clan production and gluon showers inside clans, 
in contrast to the standard parameterisation n, k where, as shown, the two vertices 
are mixed. This result leads to approximate clans to QCD bremsstrahlung gluon 
jets; gluons inside a clan follow on average a logarithmic distribution, which in turn 
can be written as a weighted average of the geometric distribution given in Eq. (88) 
(see Appendix IA.41 ) 

To summarise, the essential conditions for the QCD interpretation of the oc- 
currence of NB (Pascal) MD here discussed are: i) the independent emission of 
bremsstrahlung gluons, ii) the dominance of the vertex g — > g + g over g — > q + q, 
and iii) weak effects of coherence and conservation laws. The weakness of this ap- 
proach is that it is very hard to introduce in the QCD Markov branching process the 
dependence on rapidity and therefore investigate MD's far from full phase-space. 
We discuss in the next section some solutions to this problem. 



3.2 The kinematics problem and possible answers. 

In the treatment of the previous section, correct kinematics has not been taken into 
account: for example, at each splitting, phase-space is limited, as one should make 
sure that in a — > b + c, the virtualities sum up correctly: Q a > Qb + Qc- Fur- 
thermore, energy and momentum conservation are not ensured. In order to address 
these problems, several methods have been developed: we will briefly mention the 
most popular of them, then discuss the simplified parton shower (SPS) and the gen- 
eralised simplified parton shower (GSPS) models, which, although overlooked by 
the common wisdom, deserve in our opinion particular attention as an attempt to 
work in a fully correct kinematical framework by using essential features of QCD. 



3.2.1 DLA and MLLA and Monte Carlo 

In order to include more kinematics in the shower evolution, perturbations theory 
can be improved [[5s [57], [580 • hi the Double Log Approximation (DLA), for ex- 
ample, one takes into account the phase-space for the emission of a soft gluon; 
however, recoil is not considered for the parent parton, i.e., energy is not conserved 
in each individual emission. This can be justified in a first approximation by the 
hypothesis that emitted gluons are required to be soft, which is certainly applicable 
at asymptotic energies. Notice that DLA still allows a probabilistic interpretation of 
parton production as a cascade. 

A noteworthy result in DLA is the prediction of Koba-Nielsen-Olesen (KNO) 
scaling [159] for the MD of gluons in a gluon jet; the high multiplicity tail is given 
by 

f(n/n) oc exp(— Cn/n), (91) 
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where C ~ 2.55, calculated in pQCD. This KNO scaling form does not depend on 
the running of a s , being insensitive to all details of the evolution but the branching 
structure of the process (a s of course appears in the energy dependence of n.) 

In order to bring into play also recoil effects, thus trying to solve the problem of 
the too prolific parton production of DLA, one arrives at the Modified Leading Log 
Approximation (MLLA) equations. Recoil effects modify the argument of evolu- 
tion equations, taking the energy lost in the creation of a new parton into account. 
The equations become then differential equations with retarded terms (difference- 
differential equations). The probabilistic interpretation of the branching is still re- 
tained. 

By considering only gluon production in a gluon-initiated jet, the DLA predic- 
tion has been corrected with pre- asymptotic terms, and the KNO scaling result is 
now violated at finite energies [160]. The new result for the tail is 

f(n/n) <xexp[-(C'n/n)^}, (92) 

where /i = (1 - 7)- 1 and C = CV(1 - 7 ) 1 ^/r(l + 7) and 7 = d\ogn/d\ogQ 
is the anomalous dimension. MLLA predictions are much closer to experimental 
data than DLA ones, but the overall description is still poor, especially near the 
maximum. The tail is still close to an exponential (like that of a NB (Pascal) MD). 

In addition to study the cm. energy dependence of the MD, much can be learned 
from a study of its moments as a function of the order, at fixed energy. It was indeed 
found in [43] that it is possible to obtain analytic formulae within MLLA, even with 
the addition of higher order terms in an expansion in 7, in the limit of frozen a s , for 
the ratio H q of factorial cumulants to factorial moments. Numerical solutions [ 143 ] 
with running a s have confirmed the oscillating behaviour of H q as a function of 
the order q found analytically for solutions to the QCD equations for the generating 
functions. See Fig. [61 Oscillations of this type have been found experimentally, 
but before comparing to the pQCD predictions one has to take into account the 
truncation of the tail in the data (which was shown also to produce comparable 
oscillations [16 ll]) and the effect of hadronization (actually, there is no effect when 
using generalised local parton-hadron duality, see Section [331 ) 

In all analytical calculations however, while energy conservation is taken into 
account, momentum conservation is handled only by the angular ordering prescrip- 
tion; one then resorts to numerical methods, usually in terms of a Monte Carlo sim- 
ulation of a parton shower: the evolution of partons can be traced step-by- step at 
each LLA vertex, imposing all conservation laws and required phase-space restric- 
tions. Furthermore, not only multiplicities, but every aspect of the reaction may be 
simulated, although usually at the cost of adding extra parameters. It is remarkable 
that excellent results have been obtained in e + e~ annihilation into hadrons but not 
in minimum-bias pp collisions, where general features like MD's are still poorly 
described. 
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Figure 6: Ratio of cumulant to factorial moments, H q , as function of the order q, computed in 
pQCD, using different approximations |62|: notice that 'oscillations' only appear in NNLA (dashed 

line). 

The Monte Carlo approach has been however particularly useful to obtain phe- 
nomenological ideas to be used in analytical calculations (see, e.g., Section [331 on 
hadronization.) 

3.2.2 The SPS model 

Another possibility to introduce kinematics constraints in QCD shower evolution 
is to isolate the fundamental features of pQCD on the basis of which a simplified 
analytical model can be developed in a correct kinematical framework. This is what 



has been attempted with the Simplified Parton Shower (SPS) model [163], described 
here in some detail. 

We consider an initial parton of maximum allowed virtuality W which splits at 
virtuality Q into two partons of virtuality Q and Q\. We require Q > Qo + Qi 
and Q , Qi > 1 GeV; this implies that any parton with virtuality less than 2 GeV 
cannot split further. We define the probability for a parton of virtuality W to split at 
Q, p(Q\W), which is normalised by a Sudakov form factor, as in Eq. ( [821) or ([831 . 
Since we are interested in the structure of the solution, we will discuss only one 
type of parton, introducing a free parameter which we call A, and use 

p(Q\ w ) d Q = Q 1 ^WW d Q = d {^w) • (93) 

The probability for an ancestor parton of maximum allowed virtuality W to generate 
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n final partons, P n {W), and the probability for a parton which splits at virtuality Q 
to generate n final partons, R n (Q), with P n (Q) = R n (Q) = 8 n i for Q < 2 GeV, 
lead to the following generating functions: 

oo 

f(z,W) = J2Pn(W)z n ~ 1 , (94) 

n=l 
oo 

^(^Q) = ^^n(Q)^- 2 , (95) 
n=2 

where the series has been shifted w.r.t. the standard definition in consideration of 
the fact that there is always at least one parton in the cascade; this definition will 
simplify some of the formulae to follow. The two generating functions are linked 
by 

p2 pW 

f(z,W)= / p(Q\W)dQ+ / p(Q\W)zg(z,Q)dQ. (96) 



The joint probability density V(QoQi\Q) for a parton of virtuality Q to split into 
two partons of virtuality Q and Q\ is defined by 

V{Q Q Qx\Q) = p(Q \Q)p(Qi\Q)K(Q)6{Q -Q - Q0, (97) 

where K(Q) is a normalisation factor and the conditions on the virtualities are 
shown explicitly. The dynamical content of the model in virtuality is expressed 
by the following equation 

/"OO /"OO 

Rn(Q) = V / dQ dQ 1 V(QoQl\Q)R n -rAQo)Rn'(Ql), (98) 

n>=l Jl Jl 

which gives the probability for a parton which splits at virtuality Q to generate n 
final partons in terms of the joint probability density, Eq. (971) , and of daughter 
parton's respective showers. 

Eq. (981) can be reformulated for the corresponding generating function by di- 
viding the domain of integration in three sub-domains; one obtains 

g(z, Q) = £ dQo £ dQ 1 V(QoQi\Q) 

/2 poo 
dQo J dQ{P{QoQi\Q)zg{z,Q l ) (99) 

/*OG /*00 

+ / dQo dQ 1 V(Q Qi\Q)zg(z,Qo)zg(z,Q 1 ). 

J2 J2 

The above three sub-domains correspond to the possible different situations in 
which the two generated partons can be found, i.e., neither of them splits, only 
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Figure 7: Kinematically allowed virtuality domain in the SPS model for a parton splitting at virtu- 
ality Q producing two offspring partons of virtualities Qo and Q±, with Qo + Qi < Q- he dashed 
line indicates the extension of phase space due to the violation of the virtuality conservation law; the 
dotted lines indicate the domain in which only one of the two offsprings can split further [390. 



one parton splits, or both partons split (see Figured) This general scheme is valid 
for any choice of splitting function p(Q\W), but in the present case this function 
factorizes and Eq. ([961) simplifies into the differential equation 



df(z,W) 
dW 



p{W\W)[zg(z,W)-f(z,W)]. 



(100) 



This equation can be easily solved in two extreme cases, obtained respectively 
by restricting or by relaxing phase- space constraints. 

In the first case we allow to generate only very soft partons: 9(Q — Qo — Q\) — > 
9(Q — Qo)9{2 — Qi). This results in bremsstrahlung-like emission and a Poisson 
distribution of generated partons: 



f(z,W) = e x ^ z - 1 \ 



where 



X(W) 



w floeW 
P (Q\Q)dQ = A log ' 



(101) 



(102) 



n vlog2 
gives the average multiplicity of generated partons (in addition to the initial ances- 
tor). 

The second case is obtained by decoupling the virtualities as in 9(Q — Q — 
Qi) 9{Q — Qo)9(Q — Qi). This is exactly the case examined for the LLA with 
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fixed cut-off, Eq. ( 188]) , and indeed the result is a geometric distribution 

f(z,W) = [l-(z-l)(e>M-l)]- 1 , (103) 

with average multiplicity of generated partons e x ^ w > — 1. 

Further analytical solutions of the SPS model were not possible, but numerical 
solutions using Monte Carlo methods show that the NB (Pascal) MD describes the 



MD's rather well [63], which is not surprising considering that the NB (Pascal) MD 
interpolates between the Poisson (1/k = 0) and the geometric (k = 1) distributions. 

Moreover, by differentiating the above equations for generating functions in 
z — 1, one can obtain equations for factorial moments which are linear in the mo- 
ments themselves, although they contain all moments of lower order. This feature, 
common to other evolution equations, is a consequence of the parton shower struc- 
ture. But because the SPS model implements a correct kinematical framework, at 
small virtualities the equation simplify and a recursive solution can be achieved. 
Such a solution has to be numerical, but it was found [[39] that the virtuality evo- 
lution is consistent with pQCD results and experimental observations for values of 
the parameter A between 1 .5 and 2. The behaviour of high order factorial moments 
qualitatively agrees with experimental data and with the most detailed pQCD cal- 
culations and is still consistent with NB (Pascal) predictions. 

For describing the rapidity structure of the model, we propose to use the singular 
part of the QCD kernel controlling gluon branching: 

dz 

p(yo\QoQiQy) oc P(z )dz oc — r (104) 

Ml - zo) 

Here z is the energy fraction carried away by the produced parton in the infinite 
momentum frame. 

The limits of variation of z are fixed by the exact kinematical relations 




B-ifi J - 7T <z <B+JB*- ^ , (105) 



where B — |[1 + (Qo/Q) 2 — (Qi/Q) 2 } is the scaled parton energy in the centre of 
mass system and B 2 — (Qo/Q) 2 its maximum scaled transverse momentum. In 
this way the scaled transverse momentum (w.r.t. the parent direction) of the parton 
with energy fraction z 



and its rapidity 



¥ = \r-(f) 2 - ( *- B)2 ao6) 



V» = V + \log—^— 007) 

Z LB — Zq 
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are uniquely determined. Rapidity of the second parton of virtuality Q\ is obtained 
by energy-momentum conservation: 



yi = y + tanh 

Notice that only the first step has to be treated differently in rapidity because it 
corresponds to the degrading from the maximum allowed virtuality W to the vir- 
tuality of the first splitting Q. In this case the rapidity of the ancestor is fixed by 
conservation laws and is given by 

y = tanh^ 1 \j 1 ~ ■ (109) 

Again an analytical solution was not possible, but good fits to the Monte Carlo 
implementation of the SPS model in rapidity intervals were obtained with the NB 
(Pascal) MD. 



B 



B 



-tanh | y - y\ 



(108) 



3.2.3 The GSPS model: generalised clans 

At this level of investigation, it should be clear that still open problems are the 
lack of the analytical solution of Eq. (1991) and, in more general terms, the lack of a 
complete analytical study of the parton evolution process in rapidity. 

In order to solve part of these problems, we proposed to incorporate in the SPS 
model the idea of clans; we called this version of the model "Generalised Simplified 
Parton Shower" (GSPS) model [164, 165J. Accordingly, we decided to pay attention 
for each event to the ancestor which, splitting n times, gives rise to n subprocesses 
(one at each splitting, see Fig. [8) and we identify them with clans. Therefore in 
this model for a single event the concept of clan at parton level is not a statistical 
one, as it was in the SPS model: in the present picture the clans are independent 
active parton sources and their number in each event coincides with the number of 
splittings of the ancestor, i.e., with the number of steps in the cascade. 

Notice that each clan generation is independent of previous history (it has no 
memory); thus the process is Markovian. Furthermore each generation process 
depends on the evolution variable only and is independent of the other variables of 
the process like the number of clans already present and their virtualities. In the 
original version of the SPS model, the splitting function of the first step, p(Qo\Q), 
was different from the splitting function of all the other steps, 0(Qo\Q), obtained 
by integrating the joint probability function V(Qo, Qx\Q), 

P{Qo\Q) = ^ ° dQ 1 V(Q ,Q l \Q). (110) 
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p(yo\QoQy)p(yi\QiQy) 



p(Q\W)S(y - tanh-yi - Q 2 /W 2 ) 
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Figure 8: The structure of the Generalised Simplified Parton Shower model (GSPS) for clan pro- 
duction in virtuality Q and rapidity y. The production process is decoupled at each splitting both in 
virtuality and in rapidity, see Eqs. (11 121 ) and (11 14) . Each blob represents a clan. Notice that here 

Qo <Q,Qi< Q, \vo -y\< logQ/Qo, \yi -y\< logQ/Qi S 



In order to generalise the SPS model as it stands we assume that virtuality con- 
servation law is locally violated (although conserved globally) according to 

l<Qo<<2, l<Qi<Q- (HI) 

The upper limit of integration of Eq. ( Ill 0] ) becomes Q, the normalisation factor 
K(Q) reduces to 1 and the process becomes homogeneous in the evolution variable 
since 

rQ 

P(Qo\Q) = p(Q \Q) J p{Qi\Q)dQi= P (Q \Q). (112) 

The approximation described by this equation, therefore, can be interpreted as the 
effect of local fluctuations in virtuality occurring at each clan emission. 

This violation of the virtuality conservation law spoils of course the validity of 
the energy-momentum conservation law, which, in the SPS model, uniquely deter- 
mines the rapidity of a produced parton, given its virtuality and the virtuality and 
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rapidity of its germane parton -see Eq. ( 1 108]) . In the GSPS model the two produced 
partons at each splitting are independent both in virtuality and in rapidity; however, 
the rapidity of each parton is bounded by the extension of phase-space fixed by its 
virtuality and the virtuality of the parent parton: 

\y% -y\< iogTf- O 13 ) 

In conclusion, by weakening locally conservation laws, we decouple the production 
process of partons at each splitting. Consequently, the GSPS model allows to follow 
just a branch of the splitting, since each splitting can be seen here as the product 
of two independent parton emissions. This consideration will be particularly useful 
in discussing the structure in rapidity of the model; in fact, it is implied that the 
DGLAP kernel given in Eq. ([104) should be identified with 

p(yo\QoQy)dy p(yi\QiQy)dy 1 oc ^^1. (114) 

Zq Zi 

Then one gets the multiplicity distribution p N (W): 

Pn(W) = e-^ [X ^ N iy \ N>0. (115) 

Eq. (1 151 ) is a shifted Poisson distribution in the number of clans N, with average 
number of clans given by: 

N(W) = iVe- W^r" 1 = x{w) + 1 (116) 

N=l ^ 

We stress that this result has been obtained a priori in the present generalised ver- 
sion of the model, differently from what has been done previously in [ 166 ] where the 
independent production of clans was introduced a posteriori in order to explain the 
occurrence of NB (Pascal) regularity. 

Since clans are by definition not correlated (i.e., only particles belonging to the 
same clan are correlated, while particles belonging to different clans are not) the 
MD of clans in rapidity intervals is obtained by binomial convolution 

p N (Ay,W) = ( N U N (l-7r) N '- N p N ,(W), (117) 

N'=N ^ ' 

where iv(Ay, W) is the probability that one clan is produced within the interval Ay 
by an ancestor parton of maximum virtuality W. The average number of clans in 
the interval Ay is therefore 

N(Ay,W) =7r(Ay,W)N(W). (118) 
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In terms of generating functions 

F clan (z;Ay) = [^(Ay,^) + l-7r(Ay,iy)]e^ A ^ A ^- 1 ). (119) 

Presenting the sum of two Poissonian distribution (the first a shifted one), this re- 
lations has a nice physical meaning: the two terms correspond to the probability 
of having the ancestor within or outside the given rapidity interval. When Ay is 
very small, n(Ay, W) tends to zero and the unshifted Poissonian dominates. Thus, 
it can be stated that in the smallest rapidity intervals the clan multiplicity is to a 
good approximation Poissonian and the full MD belongs to the class of Compound 
Poisson Distribution. n(Ay, W) tends to 1 for large Ay and the exact full phase- 
space shifted-Poisson distribution is approached. Finally, when ir(Ay, W)X(W) is 
sufficiently large, the shifted-Poissonian dominates at large N (the tail of the distri- 
bution) while the unshifted one dominates at small N (the head of the distribution). 
These facts might have some consequences in interpreting the anomalies found in 
NB behaviour for small N and the deviations from NB behaviour in large rapidity 
intervals, which are controlled by the behaviour of the distribution at large N. 

The calculation of N(Ay, W) can then be carried out analytically, although the 
procedure is very tedious. The key observation is that, thanks to the decoupling, the 
elementary splitting function for an ancestor parton of virtuality Q to produce a clan 
of virtuality Q x has the same functional form of the probability that the next splitting 
of the ancestor itself happens at virtuality Q . Therefore, the ancestor and the clan 
at the last step of the shower evolution can be interchanged. We will skip the details 
of the calculation, to be found in |64[], and proceed to illustrate the results. 

In Fig. [9] the clan density is shown as a function of the width of the interval 
Ay = [—y c , y c ] for A = 2 for maximum allowed virtualities W = 50 GeV, W = 
100 GeV and W = 500 GeV. The contribution of one-parton showers turns out to 
be negligible for this choice of A. Notice that the height of the distribution is de- 
creasing (simply because the full phase-space value increases only as a double log) 
and the width increasing with the energy (phase-space grows logarithmically). Con- 
volution of clan density for two back-to-back parton showers is shown in Fig. JOI 
Notice that the central dip at y ~ is slowly removed by increasing the energy 
of the initial parton. It should be kept in mind that the structure of Figures [9] and 
[10] refers to clan production; the different behaviour for parton production inferred 
from data is not in contradiction with this behaviour since we have still to include 
in our scheme parton production within a single clan. 

In Figure CCD the average number of clans N(Ay, W) is given as a function of 
rapidity width y c for the same W values of Figure [91 Limitations on the rapidity 
intervals are determined by the available phase-space corresponding to the different 
initial parton virtualities. 

Accordingly, the GSPS model predicts, for the average number of clans at parton 
level in a single shower (jet): 
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Figure 9: Clan density in rapidity, i.e., number of clans per unit rapidity, in a single parton shower, 
according to the GSPS model with parameter A = 2. The three curves refer to different initial 
virtualities (dotted line: W = 50 GeV; dashed line: W = 100 GeV; solid line: W = 500 GeV) and 
are normalised each to its own average number of clans in full phase-space |[64]j. 
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Figure 11: Average number of clans in a central rapidity interval [—y c ,y c } as a function of the 
interval's half-width y c for a single shower in the GSPS model with A = 2. The three curves refer 
to different initial virtualities (dotted line: W — 50 GeV; dashed line: W = 100 GeV; solid line: 

W = 500 GeV) S. 




Figure 12: The same curves as in Fig. [CD rescaled in the abscissa to the respective full phase-space 
(FPS) rapidity and in the ordinate to the respective FPS average number of clans, i.e., Eq. (11201 ) vs. 

y* = yc/ytps S 



40 



a) a rise with the rapidity width y c , for different initial partem virtualities W, 
which is very close to linear for 1 < y c < t/f ps ; the rise is still linear but with a 
somewhat different slope for y c < 1. A characteristic bending occurs finally 
for rapidity width y c < y fps ; 

b) approximate (within 5%) energy independence in a fixed rapidity interval y c 
for W below 100 GeV. For higher virtualities, deviations from energy inde- 
pendence become larger; they are within 20% when comparing N(y c , 50 GeV) 
and N(y c , 500 GeV). It should be noticed that the average number of clans 
slowly decreases with virtuality; this behaviour has been already observed in 



Monte Carlo simulations for single gluon jets (670 



In addition to the above results which are consistent with our expectations on 
clan properties in parton showers, the model shows energy independent behaviour 
(see Figure [12j by normalising the average number of clans produced in a fixed 
rapidity interval \y\ < y c to the corresponding average number in full phase-space, 
and by expressing this ratio as a function of the rescaled rapidity variable y* = 

yc/yf PS - 

vyc ' ; N(W) 

This new regularity turns out to be stable for different choices of the parameter A. 
In Figure Q2i a clean linear behaviour is shown for the above ratio corresponding to 
the parameter value A=2. 

Having completed the analytical treatment of the number of clans, we now pro- 
ceed to the final partons level. In order to study the average number of clans in the 
rapidity interval Ay, N(Ay, W), in the previous treatment we limited our discus- 
sion to the first step of parton shower evolution in the GSPS model. It is clear that if 
one wants to calculate the average number of partons per clan in the same interval, 
n c (Ay, W), one has to analyse the second step of parton shower evolution, i.e., to 
study the production of partons inside clans. In order to do that, inspired by the 
criterion of simplicity and previous findings, we decided to maintain inside a clan 
the structure of the model seen in the first step. The only difference lies in the intro- 
duction of a new parameter, a, controlling the length of the cascade inside a clan, 
in the expression of the probability that a parton of virtuality Q emits a daughter 
parton in the virtuality range [Q , Q + dQo], i.e., 

Pa(Qo\Q)dQo = d (t^) ■ (121) 

The GSPS model is thus a two-parameter model: A and a, controlling the length of 
the cascade in step 1 and 2 respectively. This is equivalent to introducing the SPS 
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structure for a single clan; however, by maintaining the decoupling in each splitting 
(a la DLA) one keeps the possibility to solve the model exactly. 
The MD in a clan of virtuality Q is therefore 

Q>2 GeV 



where 



M*, Q) = < - - - (122) 

n c (Q) = e x ^ , X a (Q) = J% a (Q'\Q f )dQ' ■ (123) 

The solutions correspond to a shifted geometric distribution (Q > 2 GeV) and to 
a clan with only one parton (Q < 2 GeV). The bound 2 GeV is a consequence of 
the fact that in the GSPS model the virtuality cut-off is fixed at 1 GeV (a parton 
with virtuality Q < 2 GeV cannot split any further byas sumption). Notice that 
this finding agrees with the clan model discussed in [168], where the logarithmic 
MD for partons inside average clans is interpreted as the result of an average over 
geometrically distributed single clans of different multiplicity, i.e., initial virtuality 
(See also Appendix IA.41 ) 

We now calculate the generating function of partons MD in a rapidity interval 
Ay inside a clan of virtuality Q and rapidity y: this is done through a binomial 
convolution on the corresponding generating function in full phase-space: 



l+(z-l)n a {Ay,Q,y) 



Q>2 GeV 



gAv(z,Q,y) = < 1 -^»(Av.Q.»)(fic(Q)-i)(i-*) - ( 12 4) 
" \l + (z-l)n a (Ay,Q,y) Q < 2 GeV 

where n a (Ay, Q, y) is the probability that a clan of initial virtuality Q and rapidity 
y produces a daughter parton inside the interval Ay. Notice that this approximation 
neglects rapidity correlations among particles in the same clan. However, it allows 
exact analytical solutions (although long and cumbersome to handle). The average 
number of partons per clan is then 

n c (A yi Q,y) = ir a (A yi Q,y)e x ^ Q \ (125) 

When the above equation is averaged over the probability that a parton of maxi- 
mum allowed virtuality W produces a clan of virtuality Q and rapidity y, we obtain 
the average number of partons in an average clan generated in a shower of virtuality 
W. As probability over which we average, one can use the bi-dimensional clan den- 
sity in virtuality and rapidity normalised by the average number of clans. Results of 
the calculations of the average number of particles per clan as a function of the ra- 
pidity interval Ay and of the maximum allowed virtuality W with A = 2 and a = 1 
are shown in Fig. M for W= 50 GeV (solid line), W=\00 GeV (dashed line) and 
W= 500 GeV (dotted line). The trend fully coincide with the behaviour of clans 
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Figure 13: Average number of partons per clan in a central rapidity interval [— y c , y c ] as a function 
of the interval's half-width y c for a single shower in the GSPS model with A = 2, a = 1. The three 
curves refer to different initial virtualities (dotted line: W = 50 GeV; dashed line: W = 100 GeV; 

solid line: W = 500 GeV) & 

structure parameters obtained by analysing quark and gluon jets MD's [67[|. The 
result of the analytical calculation of the average number of partons in the shower, 
n(Ay, W) is shown in Figure Q3 with the same parameters. The average parton 
multiplicity grows almost linearly with rapidity for relatively small Ay intervals 
and then it is slowly bending for Ay intervals approaching f.p.s., where it reaches 
its maximum. It is interesting to remark that the normalised average number of 
partons in the shower, n(Ay, W)/n(fps, W) scales in virtuality as a function of the 
rescaled rapidity interval, Ay/fps, see Figure CES This scaling in W is found to 
depend on the parameter a, as different values of a give different scaling curves, 
differently from the scaling found for for N(Ay, W) / N(fps, W) which is indepen- 
dent of the mechanism inside clans. 

In conclusion, the GSPS model is a parton shower model which was built assum- 
ing QCD-inspired dependence of the splitting functions in virtuality and in rapidity, 
with Sudakov form factors for their normalisation; in addition to these ingredients 
(which were called "essentials of QCD"), the characterising feature was introduced 
of distinguishing explicitly the two steps of clan production and subsequent decay, 
allowing at each step in the cascade local violations of energy-momentum conser- 
vation laws but requiring its global validity. This model was found to have an im- 
portant predictive power in regions not accessible to pQCD. Analytical calculations 




43 




Figure 14: Average number of partons in a central rapidity interval [— y c , y c ] as a function of the 
interval's half-width y c for a single shower in the GSPS model with A — 2, a = 1, The three curves 
refer to different initial virtualities (dotted line: W — 50 GeV; dashed line: W = 100 GeV; solid 

line: W = 500 GeV) @]. 
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Figure 15: Same curves as in Fig. [14} rescaled in abscissa to the respective FPS rapidity, and in 
ordinate to the respective average FPS multiplicity |] 65[| . 
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of the virtuality and rapidity dependence of the average number of clans and of the 
average number of particles per clan have been carried out. Results are consistent 
with what is known on clan properties in single quark and gluon jets, disentangled 
at hadron level by using jet finding algorithms and analysed at parton level assum- 
ing that generalised local parton-hadron duality (discussed in the next Section) is 
applicable. 



3.3 Hadronization prescriptions 

Having performed computations at parton level, the problem arises on how to make 
the connection with the measured hadron level. Such a process is of course non- 
perturbatiye in nature, and usually approached through various models: the string 



model [[69] and the cluster model [ 170] are widely used in Monte Carlo calculations; 
statistical hadronisation models [ 171 1 ] are now starting to know considerable success. 

It has been noticed however that many predictions of perturbation theory can 
reproduce experimental results down to low virtuality scales, and often give the 
correct energy evolution except for an overall normalisation. This behaviour can 
be expressed in terms of pre-confinement: the perturbative evolution is continued 
to low virtualities while partons rearrange themselves in their evolution to form 
colour singlet clusters which hadronize subsequently at a soft scale of the order of 
the perturbative cut-off to the shower. This picture lead to the Local Parton-Hadron 
Duality (LPHD, 'weak' duality) prescription: single-particle inclusive distributions 
at hadron level are taken proportional to the corresponding distribution at parton 
level: 

Qf\y) = pQ i ?\y), (126) 

where in general p ps 2. It is a way to investigate to what extent pQCD can di- 
rectly reproduce experimental data up to a rescaling factor. In particular, integrating 
Eq. (11261) one obtains for the average multiplicities: 

# = pn {p) . (127) 



In [1261], it was noticed in that Monte Carlo calculations of MD's, the NB (Pascal) 
MD provided a good fit both at parton and hadron level, with the same parameter 
k. This lead to the formulation of Generalised LPHD (GLPHD; 'strong' duality) 
which brings into play higher order inclusive distributions: 

QM(y 1 ,...,y n ) = p n Q ( *\y 1 ,...,y n ). (128) 

In general, one expects p > 1 . Integrating over rapidity, one obtains the proportion- 
ality of (un-normalised) factorial moments: 

F^=p n F^; (129) 
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recalling Eq. ( [26]) which links factorial moments with generating functions, one 
obtains the relation: 

G {h \z) = G {p \l -p + pz). (130) 

This relation leaves unchanged all the observables that do not depend on the aver- 
age multiplicity. It can easily be seen, e.g., that normalised factorial moments are 
the same at parton and hadron level, being = pF± . The same is true for 
normalised factorial cumulant moments, since they can be expressed as a sum over 
factorial moments (recall the cluster expansion that links inclusive distributions to 
correlation functions, Eq. (§). Of course, it follows that also the ratio K n jF n = H n 
is invariant under the GLPHD transformation: 

K^=p n K^; (131) 
H W = K W /F W = Kto/F® = H$\ (132) 

When applying Eq. (1130) to MD's which are infinitely divisible at parton level, 
one still obtains (barring pathological cases) distributions which are CPMD's: 

G {h) {z) = exp{iV (p V P) (l -p + pz)- 1]}. (133) 

However, at z = one obtains g^ p \l — p) ^ 0, which violates the condition that 
there are no empty clans. One has to redefine the MD within clans [72,1731]: 

q iH) {z) = 9^(1- P + pz)- 9^(1- P) 

9 [Z) l_^(p)(l_p) • UJ4j 

This is equivalent to the following transformation on the clan parameters: 

ft(h) = ft(p) ^ _ g ( P )^ _ p )j ; (135) 
nf) = fiW- P (136) 

Because (in reasonable situations) ^^(1 — p) < 0, one has N^' > N^ p \ It is 
interesting to remark that hadronic clans do not coincide with the partonic ones: 



hadronisation creates in general new clans (or breaks partonic ones) [74]]. Indeed 



one also has > ftf 1 . The exact sharing of the multiplicity increase between N 
and n c depends on the actual shape of g^ p \z). 

As an example, in the case of the NB (Pascal) MD we obtain that GLPHD is 
equivalent to the following simple requirement: 

n (h) = pn (p) ) k {h) = k (p) . (137) 
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notice the NB(Pascal) shape is not lost, only one parameter changes when going 
from the partonic to the hadronic level. In this example, both the average num- 
ber of clans and the average number of partons (particles) per clan increase during 
hadronization: 

ftW = k { P ) ln (i + pfl { P ) i k { P )^ ; (1 38) 

fi W = P^- (139) 

It is worth pointing out that GLPHD has some drawbacks: relation (130) resem- 
bles a convolution with a binomial distribution, except that p > 1. On one hand 
this fact prevents a probabilistic interpretation of GLPHD: it is impossible to define 
event by event a probability distribution for obtaining n particles from m partons 
satisfying Eq. (130) . On the other hand it allows to study the distributions that are 
invariant in form under transformation (130) : they are all MD's whose GF depends 
on z and n only via the product n(z—l), i.e., which satisfy the following differential 
equation: 

This equation can be formally solved and gives, at the level of probabilities, Eq. ( [53]) 
—see [H]. 

It should be clear by now that GLPHD is a very strong prescription, certainly 
too strong: there are effects present at hadronic level only (e.g., resonances' decays) 
which affect hadronic correlations and not partonic ones; on the contrary, GLPHD 
fixes all correlations already at parton level (except for their overall strength). 
Nonetheless, GLPHD is very useful as a tool to investigate the predictive power 
of purely perturbative calculations: recall for example the case of H q moments os- 
cillations. 
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4 COLLECTIVE VARIABLES REGULARITIES IN MULTIPARTICLE PRO- 
DUCTION: DATA AND PERSPECTIVES 



The attempt to achieve a unified, QCD-inspired description of multiparticle produc- 
tion in all classes of collisions, in full phase-space and in its limited intervals, both at 
final hadron and parton level and from the soft sectors up to the hard ones, including 
high parton density regions, is the challenge in the field. The main motivation of the 
present section is the conviction — already stated in the introduction — that complex 
structure which we observe might very well be, at the origin, simple, and that such 
initial simplicity manifests itself in terms of regularities of final particle multiplici- 
ties. With this aim, it is instructive and stimulating — in our opinion — both from a 
theoretical and an experimental point of view, to follow the advent of the NB (Pas- 
cal) MD regularity and of KNO scaling violation in multiparticle production (facts 
which haven't yet been fully appreciated in all their implications), then to see the 
sudden failure of the regularity as a consequence of the shoulder structure observed 
in n charged particle MD's P n when plotted vs. n, and finally its reappearance at a 
deeper level of investigation, i.e., in the description of the substructures or classes 
of events of the various collisions. 

What makes even more attractive this development is the finding that, under 
certain reasonable assumptions, the NB (Pascal) MD occurs also (see Section S) at 
final parton level: the generating function of the NB (Pascal) MD is the solution 
of the differential QCD evolution Equations which can be understood as a Markov 
branching process initiated by a parton and controlled in its development by QCD 
elementary probabilities. It should be noticed that the same regularity appears also 
in final n-parton multiplicity distributions in qq and gg systems in Jetset 7.2 Monte 
Carlo calculations based on DGLAP equations. In addition, by using a convenient 
guesswork as hadronization prescription, the NB (Pascal) MD describes within the 
same Monte Carlo generator the hadron level, which turns out not to be independent 
from the parton level but linked to it by strong GLHP duality. All these results can 
be interpreted, as we shall see, in terms of clan structure analysis and suggest that 
the dynamical mechanism responsible of multiparticle production in all classes of 
collisions is independent intermediate gluon sources (the clan ancestors) emission 
followed by QCD parton shower formation. 



4.1 An unsuspected regularity in particle production in cosmic ray physics 

After the discovery of multiparticle production (groups of "mesotrons") in cosmic 
rays in the thirties and the important theoretical work in the forties and early fifties 
in order to explain the highly non linear new phenomenon, the contribution of the 
1966 paper by P.M. MacKeown and A.W. Wolfendale [11] should be mentioned. Its 
interest lies in the attempt to describe all available experimental data on exclusive 
n-particle cross sections, a n (E ), for producing n charged pions generated by a 
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Table 1: NB (Pascal) MD parameters calculated at different primary nucleon energies Eq, using 

Eqs (Bp and fl43) 
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primary nucleon at different energies E (from 25 GeV up to 5 • 10 4 GeV) by using 
a NB (Pascal) MD, i.e. 



-n-k(E ) n—1 



in 



n 



k{E ) 



3=1 



with n = n s — 1, n a being the number of shower tracks. According to the fits 
shown in Figure J6| one can conclude that particles in cosmic ray are not indepen- 
dently produced but are highly correlated. It should be noticed that the Authors of 
[11 ] determined also the energy dependence of the two standard parameters of the 
proposed phenomenological multiplicity distribution. They found 

k-\Eo) = 0.4(1 - exp[-1.8 • 10~ 4 E ]) (142) 

and 

n = 1.8£ 1/4 . (143) 

Particle correlations are controlled by 1/k and become stronger as the primary en- 
ergy becomes larger. To the growth of n from 4.02 at primary energy E = 25 GeV 
to 26.92 at E = 5 • 10 4 it corresponds in the same energy range the decrease of k 
parameter from 557 to 2.5 . This result is even more stimulating when interpreted in 
the framework of clan structure analysis: the average number of clans, N, is grow- 
ing from w 4 to 6.16 and the average number of particles per clan, n c , from rs 1.0 
to 4.37, indicating a strong clustering effect as the primary energy becomes larger 
(Table CD) 
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Figure 16: MD of produced particles from a variety of experiments in cosmic ray vs the number of 
charged tracks n s . The curves represent NB (Pascal) MD's (a) E Q = 30 GeV, (b) E = 25 GeV, (c) 
E = 300 GeV, (d) E = 1.5 • 10 4 GeV, (e) E = 10 4 GeV, where E is the primary nucleon energy 
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4.2 The occurrence of the NB (Pascal) regularity in full phase-space in the accel- 
erator region and the generalised multiperipheral bootstrap model. 

In 1972 the KNO scaling violation and the widening of the n charged particle MD 
with the increase of pi a b are confirmed in hadron-hadron collisions in the accelerator 
region uM- It is found that the NB (Pascal) MD describes n charged particle MD's, 
P n , when plotted vs. n; to the increase with p lab of the average charged multiplicity n 
one notices the decrease of the parameter k, i.e., the dispersion D = a/ (n 2 ) — (n) 2 
becomes larger in contrast with multiperipheral model predictions and in agreement 
with cosmic rays experimental findings. The inverse of k NB parameter, g 2 = k" 1 , 
is interpreted in the generalised multiperipheral bootstrap model [115] as the ratio be- 
tween the pomeron-reggeon-particle vertex controlling diffraction phenomena (g 2 p ) 
and the reggeon-reggeon particle vertex (g 2 M ). g 2 is evaluated to vary between 0.01 
at 30 GeV/c and 0.67 in the above mentioned extreme cosmic ray experiments. 
The occurrence of the NB distribution in the generalised multiperipheral bootstrap 
model is understood as the effect of the weighted superposition of r multiperipheral 
diagrams with n r = rn and n the average multiplicity of the first Poissonian mul- 
tiperipheral diagram, i.e., n = g 2 M hi(s / m a mb) (m a and m b are the masses of the 
initial particles and y/s the cm. energy). The distribution function which weights 
the average multiplicity in the set of multiperipheral diagrams is a function of g 2 
and r and is assumed to be a gamma distribution 

f(r,g 2 ) = l ( (^ 2)/ ^ 2! eM-r/9 2 )- (144) 

It follows 

poo ( =\n ( I n 2)(l-g 2 )/g 2 

o n = a tot / ^y- exp[-rn] 1 9 } exp(-r/g 2 )d(r/g 2 ), (145) 

that is 

„. = ,„ ( " )- (l±^WM (1+ ^. (14 6) 
\1 + g 2 n J n\ 

For g 2 P = 0, n w g 2 M \n(s/m a m b ) and 

On ~ o tot n n — (147) 
n\ 

and in general l/k(s/s ) = g 2 P j 'g 2 M h(s/ 's ). From a purely phenomenological 
point of view, it should be pointed out that multiplicity distributions of available 
hadron-hadron collision experiments in the accelerator region were all successfully 
described by the NB (Pascal) MD (see for example Fig. [17) in a series of papers 
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Figure 17: NB (Pascal) MD (solid line) and Poisson (dashed line) formulae plotted vs. the number 
of charged particles n compared with data on pp collisions at different projectile momenta in the 

laboratory frame |fl4] "|. 
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Figure 18: Overall 1/fc data as a function of projectile momentum in the laboratory frame pi^ for 

the reactions listed UM - 



which were overlooked in the field. In addition, deviations from Poisson MD were 
interpreted as the onset of pomeron-reggeon vertex. Of particular interest was the 
fact that k~ l parameter was increasing with cm. energy in the accelerator region 
as in cosmic rays in all hadron-hadron reactions and crossing the zero point in the 
low energy domain (few GeV/c), as shown in Figure CEHl in agreement with later 
findings []25l, 75[|. 

An alternative interpretation of the regularity in terms of a stochastic cell model 
was also suggested by one of the present Authors in 1973 [13] and then extensively 
discussed by P. Carruthers [76J . k" 1 parameter is understood in this framework as a 
stimulated emission factor, i.e., the fraction of particles already present stimulating 
the emission of an additional particle. 
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4.3 The advent of the regularity in different classes of collisions and in rapidity 
intervals, and its interpretation in terms of clans. 

In 1985, the UA5 collaboration rediscovered NB (Pascal) MD for describing CERN 
pp collider MD data in full phase-space and in symmetric rapidity intervals [77 ] as 
shown in Fig. [19j 

In going from ISR and SERPUKHOV up to CERN pp Collider data 0, 1§(J], n 
becomes larger and k parameter smaller as the cm. energy of the collision becomes 
larger, as shown in Figure [2Q1 confirming the general trends of NB parameters al- 
ready observed in the accelerator and in the cosmic ray regions. In addition, pa- 
rameter k is smaller in central pseudo-rapidity intervals, suggesting the onset of a 
significant dynamical effect in central regions not hidden by conservation laws, and 
indicating stronger particle correlations with respect to the larger intervals where 
conservation laws are the dominant effect (see [8 II ] for a detailed review). 

The next question concerned the possible extension of the regularity to other 
classes of collisions, both in the available energy domain and in symmetric (pseudo)- 
rapidity intervals. The answer came from NA22, EMC and TASSO collaborations 
as shown in Figure [2T1 P n vs. n data were very well fitted by NB (Pascal) MD's in 
hadron-hadron [17], lepton-hadron [ 121 1 ] and lepton-lepton [ 11 8] collisions at nearly 
the same cm. energy both in FPS and in (pseudo)-rapidity intervals. The success of 
the regularity in describing n charged particle MD's in all classes of collisions and 
in symmetric rapidity intervals, i.e., its universality, demanded an interpretation in 
more physical terms. Previous phenomenological models, although leading to the 
experimentally observed n charged particle multiplicity distribution, were too poor 
and in a certain sense too naive for the new wide domain of validity of the regular- 
ity, especially in view also of the almost simultaneous achievements of QCD as the 
theory of strong interactions. This search was developed in two moments. 

Firstly the characterisation of the observed regularity at hadron level in more 
physical terms was studied. Cascading and stimulated emission appeared as the 
natural candidates for the dynamical mechanisms leading to the observed distribu- 
tion. It is interesting to point out that, according to the interpretation of fc -1 , in 
the stimulated emission framework A;^ 1 should be always larger than or equal to 1. 
Since stimulating emission was excluded by experiments (i.e., the fraction of par- 
ticles already present stimulating the emission of a new particle was larger in the 
total sample of positive and negative particles than in the separate samples with only 
positive or negative particles [82]), the attention was concentrated on the cascading 
mechanism. Attention is paid to the possibility of particles already produced to emit 
additional particles and leads to grouping of particles into clusters (which later were 
called clans) [H]. 

The mechanism underlined by the NB (Pascal) regularity goes as follows (recall 
Section [2T2] for the underlying mathematical structure). Clans are by definition in- 
dependently produced, each clan contains at least one particle by assumption and all 
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Figure 20: Compilation of measurements of the average charged multiplicity n and of the nor- 
malised variance D 2 /n 2 in pp and pp collisions. Superimposed are interpolations to the NB (Pascal) 
MD parameters n and k and their sum (the band takes into account interpolation errors) according 
to the NB relation D 2 /n 2 = 1/fc + 1/n il6^. 
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NA22: ji + p collisions 



TASSO: e + e" annihilation 




Figure 21: Multiplicity distributions for final charged particles in various reactions but at similar 
cm. energies; data are taken in central symmetric rapidity intervals \y\ < y c and in full phase-space 
as shown by the labels. The lines are fits with NB (Pascal) MD [79] . 
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Figure 22: Average number of clans, N (left panel), and average number of particles per clan, n c 
(right panel) vs. the half-width of the pseudo-rapidity (for 546 GeV data) or rapidity (for the other 

energies) interval |fl9^. 



correlations are exhausted within each clan. Clan ancestors, after their production, 
generate additional particles via cascading according to a logarithmic multiplicity 
distribution. Two new parameters are the average number of clans, N, and the aver- 
age number of particles per clan, n c . They are linked to the standard NB parameters 
n and k by the following non trivial relations 

N = k\n(l +n/k) and n c = n/N. (148) 

Clan structure analysis reveals new interesting properties when applied to above 
mentioned collisions as shown in Figure[22l In particular it is shown that N(e + e~) is 
much larger than N(pp), whereas n c (e + e~) is much smaller than n c (pp), in addition 
clans in central rapidity intervals are larger than in more peripheral intervals. The 
deep inelastic case is intermediate among the previous two: clans are less numerous 
than that in e + e~ but the average number of particles per clan is much larger. 

Secondly, it was compulsory to estimate final n-parton MD's in a region were 
QCD had poor predictions. The idea was to rely on Monte Carlo calculations. The 
choice of Jetset 7.2 seemed particularly appropriate [83]. Jetset is based indeed on 
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Figure 23: Clan parameters at partonic and hadronic levels in the Jetset 7.2 Monte Carlo in the qq 
and gg systems at different energies. Lines ares drawn to group together points pertaining to the 

same initial state (qq or gg) \2(y\ . 
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DGLAP equations at parton level and has Lund string frag me ntation as hadroniza- 
tion prescription. Extending previous work by W. Kittel [1251], qq and gg systems 
were studied as shown in Figure [23] [26]. Quite satisfactory NB fits for final par- 
ton and charged hadron multiplicities in full phase-space and in symmetric rapidity 
windows were found. In addition, k~ l was decreasing for growing symmetric ra- 
pidity intervals at fixed cm. energy and increasing with cm. energy in fixed rapid- 
ity intervals. Finally N was approximately energy independent in a fixed rapidity 
interval and linearly growing with rapidity variable at fixed cm. energy. These 
results, together with the previous finding that q- and g-jets can be interpreted as 
QCD Markov branching processes, led to the conclusion that partonic clans are 
quite similar to bremsstrahlung gluon jets (BGJ's). At parton level, the claim is that 
clan ancestors are independent intermediate gluon sources. This finding, together 
with the discovery that final partonic and hadronic MD's in Jetset 7.2 were not in- 
dependent but linked by the GLPHD (i.e., n ha dron ~ P^parton and k paiton rs k hadmn , 
see Section |3.3[ ), led to a convincing interpretation of the differences seen in clan 
structure analysis of lepton-lepton and hadron-hadron collisions: in pp collisions, 
the independent intermediate gluon sources become active at very high virtuality 
(a remark which would suggest a larger population per clan and strong colour ex- 
change processes); a situation which is to be contrasted with what is seen in e + e~ 
annihilation, where gluon sources are active quite late at relatively low virtuality, 
thus generating a large number of BGJ with a small amount of particles per clan. 
New results from Tasso |20J ] and ISR [80] on charged particle MD's confirmed the 
just examined properties. 

Interestingly, data on multiplicity distributions in proton-nucleus collisions [84 ] 
revealed new features when studied in terms of clan structure analysis [68]. In par- 
ticular, it was found that the regularity is satisfied at final hadron level in p+Au, 
p+Xe and p+Ar collisions. By applying then GLPHD to the hadronic sector, par- 
tonic level properties in proton-nucleus were determined. It was found that the 
partonic level was much simpler than the hadronic one. The mean number of clan 
(BGJ) is controlled by the central region in rapidity (it is equal within errors for the 
forward and backward hemispheres) and it shows a weak A-dependence. The in- 
crease of the backward hemisphere multiplicity with A is caused by the multiplicity 
increase of the backward BGJ's, which might be due to larger virtualities of their 
initial gluons and/or a more rapid shower development. The relative increase with 
A of the average number of clans in pA with respect to pp collisions is interpreted 
as due to multiple collisions in the target nucleus. Using Poisson statistics for the 
successive emission of the projectile, the mean number of downstream collisions 
suffered by the projectile with a target nucleon can be defined. It has been found 
that 

i/ = [u/{l - e-°)] - 1, (149) 
with v = Aai ne i(pp) / o~i ne i(pA) the mean number of inelastic projectile-nucleus col- 
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Figure 24: Ratio of the average number of partonic clans in p+A collisions to average number of 
partonic clans in pp collisions as a function of the half-width of the rapidity interval. The error bar 
shown at yo = 2 is indicative of the error size at all ?/o' s iH]- 



lisions (they are 2.4, 3.3 and 3.9 for argon, xenon and gold respectively). By mea- 
suring the relative increase of iV parton due to the multiple collisions of the target 
nucleus A, i.e., 

R = iV parton (pA) /N panon (pp) , (150) 

(see Figure [24]), it should be pointed out that its comparison with the mean number 
of downstream collisions suffered by the projectile after its first collision with a 
target nucleon decreases from « 0.27 at y = 0.5 to « 0.17 at y = 3.5. It 
is remarkable that the above mentioned nuclear effect leads to a strongly reduced 
capability of the wounded projectile for further emission of BGJ's. 

In low and intermediate energy nucleus-nucleus collisions, it was also found that 
the NB (Pascal) regularity is obeyed [85[|, especially in small phase-space intervals, 
and in particular that it leads to a very short correlation length [ 186 ] . 

The lesson we learn is that the occurrence of the regularity in all reactions sug- 
gests also here a two-step production process: to the independent intermediate gluon 
sources (the bremsstrahlung gluons), it follows their cascading according to parton 
showers (the BGJ's). In ultrarelativistic nucleus-nucleus collisions, one should ex- 
pect more randomness and higher parton densities and, accordingly, more time for 
parton shower and clan formation. Reduced clan production is therefore, when it 
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occurs, an important signal which should be studied with great attention (see Sec- 
tion EH) 



4.4 The violation of the regularity and its occurrence at a more fundamental level 
of investigation, i.e., in different classes of events contributing to the total 
sample in e + e~ annihilation and in pp collisions 

New experimental facts were around the corner. Collective variable properties in 
e + e~ annihilation and in pp collisions show impressive similarities and quite in- 
triguing differences. Both classes of reactions are characterised by a shoulder struc- 



ture [[27], [281, |29j] visible in the intermediate multiplicity range in n charged particle 
multiplicity distributions P n when plotted vs. n at LEP and at top pp collider en- 
ergies respectively. In addition, the ratio of factorial cumulants K n to factorial 
moments F n , i.e., H n , when plotted as a function of its order n, decreases sharply 
to a negative minimum and follows then a quasi-oscillatory behaviour [87]. In or- 
der to explain both facts in e + e~ annihilation, properties of multi-parton final states 
can be computed in the framework of perturbative QCD by exploiting its branching 
structure and then extended to multi-hadron final states by comparing partonic and 
hadronic distributions under the assumption of LPHD. Results along this line are not 
fully satisfactory. An alternative phenomenological approach consists in thinking 
that the mentioned effects are due to the weighted superposition mechanism of two 
classes of events, the first one with two jets and the second one with three or more 
jets, as identified by a suitable jet finding algorithm [88]. The phenomenological 
guesswork is that both classes are described by a NB (Pascal) MD with characteris- 
tic, different NB parameters. It turns out that in this approach the shoulder structure 
in P n vs. n and H n vs. n oscillations are correctly described [32]. 

In addition, a convincing description of the two effects occurring also in pp 
collisions can be obtained by the same mechanism assuming that the superposi- 
tion occurs between soft (without mini-jets) and semi-hard (with mini-jets) events 
(whose counterpart at parton level are single- and double-parton scattering) and that 
again each class of events is described in terms of a NB (Pascal) MD. The intriguing 
difference between the two collisions appears when one tries to describe the energy 
dependence of FB multiplicity correlation strengths. 



It is found that at LEP [89 ] the two-jet sample of events and the multi-jet sam- 
ple of events, separately considered, do not show FB MC, whereas it has to be 
pointed out that FB MC occurs in the total sample of events. By knowing the aver- 
age charged multiplicities and the dispersion of each class of events and the relative 
weight of two classes, FB MC of the total sample are correctly reproduced within 
experimental errors thanks to a formula based on the weighted superposition mech- 
anism of the two classes of events. This situation should be contrasted with pp 
collisions. Here FB MC are growing in the separate samples of events with and 
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without mini-jets, and of course in the total sample of events. It is shown, in ad- 
dition, that FB MC cannot be explained in this case without introducing clans and 
particle leakage from clans in one hemisphere to the opposite one, i.e., without as- 
suming the form of the MD in each sample of events ^331]. Clans of particles become 
essential in this context. 

In conclusion the regularity is not a property of the full sample of events in 
the examined collisions but a characteristic property of the substructures of each 
collision, i.e., of the separate sample of events contributing to the total sample. In 
the following, the mentioned features of the substructures expected in two classes 
of collisions, as they are guessed today, will be explored with the warning to be jet- 
finding algorithm dependent in e + e~ annihilation and mini -jet definition dependent 
in pp collision. A universally accepted definition of the mentioned substructures is 
indeed per se a fascinating search for future experimental work! 



4.5 Shoulder effect in P n vs. n at top UA5 energy and in e + e~ annihilation and 
its removal by means of the weighted superposition mechanism of different 
classes of events 

The mentioned shoulder structure in P n vs. n is well visible in UA5 data, especially 
at the top energy of 900 GeV, as shown in Fig. [22 by the comparison of data to 
fits with one NB (Pascal) MD. The same data were best fitted [42] by the weighted 
superposition of two NB (Pascal) MD's, as follows: 

P n = aP^im, h) + (1 - a)P n NBD (^2, h). (151) 

This structure corresponds to the superpositions of two classes of events, here called 
generically 1 and 2, the MD of each class being described by a NB (Pascal) MD, 
but with parameters which are different in each class. The fraction of events of class 
1 is given in Eq. Q5D by a. An example of the fit is shown in Fig. [261 

It was founds that the fitted values of a corresponded to the fraction of event 
without mini -jets in analyses performed by the UA1 collaboration at the same en- 
ergies |90[], and the relation between fi\ and n 2 in the fit also coincided with UA1 
findings (n 2 ~ 2ni) [91]. Because the UA5 detector could not identify mini-jets, 
the just mentioned analysis could not be verified completely; further insight is now 
available from Tevatron (Sec. |4.8[) . 

A very similar shoulder structure was found in e + e~ annihilation at LEP: the 



precise data of DELPHI [ 88 ] are shown in Fig. [271 In this case it was possible to 
carry out the analysis by selecting events with a fixed number of jets; this was done 
by DELPHI using the JADE algorithm, and the stability of the fit was verified by 
varying the resolution of the jet finder algorithm. The MD of each class (i.e., 2-, 3- 
and 4-jet events) was successfully fitted by a NB (Pascal) MD; an example is shown 
in Figure [281 
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Figure 27: Multiplicity distributions in different rapidity intervals \y\ < y c and in FPS in e + e~ 
annihilation at the Z° peak shown in KNO variables. The lines show best fits with the NB (Pascal) 
MD, which does not reproduce the shoulder structure [ [2911 . 
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Figure 28: Multiplicity distributions in different rapidity intervals \y\ < y c for 2-jet events (tri- 
angles), 3-jet events (squares) and 4-jet events (circles) in e + e~ annihilation at LEP. Each class is 
normalised to its fraction. Jets were counted using the JADE algorithm with parameter = 0.4. 
The lines show best fits with the NB (Pascal) MD for each subsample js$ \. 



67 



0.0010 

0.0005 

0.0000 

-0.0005 

-0.0010 
0.0010 

0.0005 

0.0000 

-0.0005 

-0.0010 




0.0010 



0.0005 



0.0000 



-0.0005 



-0.0010 



0.0010 



0.0005 



0.0000 



-0.0005 



-0.0010 



- 1 — i — i r~ 



I 1 1 1 ' I ' 

DELPHI 



__L J.. —I. J I I I J L 



10 




15 




Figure 29: The ratio of factorial cumulants over factorial moments, H q , as a function of q; exper- 
imental data (diamonds) are compared to H q moments computed from the weighted superposition 
of two NB (Pascal) MD (solid lines), truncated at the same multiplicity as the data. For each experi- 
ment, different superposition appear corresponding to different values of the resolution parameter of 
jet-finder algorithm. The dashed lines in the SLD plot show predictions of the same parametrisation 
as the solid lines but without taking into account the effect of truncation. In the figure only statistical 

errors are shown [320 . 
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Figure 30: Ratio of factorial cumulant moments over factorial moments, H q , vs. the order q in pp 
collisions at cm. energies 200 GeV, 546 GeV and 900 GeV. The solid line is the prediction of the fit 
with the weighted superposition of two NB(Pascal) MD's truncated at the same multiplicity as the 

MD data j6^\. 
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Figure 31: Ratio of factorial cumulant moments over factorial moments, H q as a function of q; 
e + e~ experimental data (circles) from the SLD Collaboration at at y/s = 91 GeV are compared 
with the predictions of several parameterisations, with parameters fitted to the data on MD's: a full 
NBD (dotted line); a truncated NBD (dot-dashed line); sum of two full NBD's (dashed line); sum of 

two truncated NBD's (solid line) 



4.6 H q vs q oscillations and the weighted superposition mechanism 

After the prediction of sign oscillations of H q moments of the MD, when plotted 
against the order q, at parton level in analytical QCD calculations (See Sec. 13.2.11 ), 
such oscillations where looked for and found also in data [[87!], again in all classes of 
collisions (see, e.g., Figs. [29] and [30]). However, QCD predictions are only qualita- 
tively in agreement with e + e~ data, but quantitatively rather far, in particular in the 
computation of the magnitude of H q moments. Indeed, the fact that experimental 
data are truncated — due to the finiteness of the data sample — plays an important 
role in the magnitude of the oscillations [6l|]. 

It was shown previously (Sec. 12.4-] ) that one NB (Pascal) MD also shows such 
sign oscillations when truncated — a complete NB does not, see Eq. (33) . That a 
single truncated NB can reproduce pp collisions data is shown by the solid lines in 
Fig. SOI notice that this happens even if the fits to the MD are not satisfactory (recall 
Fig.gS) 

The same is not true in the case of e + e~ annihilation. An accurate analysis was 
performed on SLD data at 91 GeV cm. energy, which are badly fitted by one NB 
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but well fitted by the weighted superposition of two such distributions. The H q 
moments computed by SLD are shown in Fig. [3T[ The single NB (Pascal) MD, 
even when truncated, cannot describe the H q moments (dot-dashed line in Fig. |3T1 ) 
The weighted superposition of two complete NB (Pascal) MD's comes closer, but 
is not enough (dashed line in Fig. [311 ) It is only after truncating, as in the data, the 
weighted superposition of two NB (Pascal) MD's that a successful description can 
be achieved (solid line in Fig. |3Ti ) 



4. 7 Towards the TeV energy domain in pp collisions. 

The main scope of this Section is to explore possible scenarios for multiparticle 
production in pp collisions in the TeV energy domain following our knowledge of 
the GeV energy region. 

The main conclusion of the phenomenological study on multiparticle production 
in pp collisions in the GeV region is that there are two classes of events, the class 
of soft (without mini-jets) events and the class of semi-hard (with mini-jets) events, 
whose n charged particle multiplicity distributions can be described in terms of NB 
(Pascal) MD's with characteristic and ki (i = soft, semi-hard) parameters, i.e., the 
NB (Pascal) MD regularity is still applicable but at a deeper level of investigation 
than initially thought. The weighted superposition of the two components, each de- 
scribed by a NB (Pascal) MD, explains indeed at least three important experimental 
facts of multiparticle production in pp collisions, i.e., 

a. the shoulder effect in P n vs. n; 

b. the oscillations in H q vs. q; 

c. forward-backward multiplicity correlations (these will be described in detail 
in Sec. WM . 

The success of the phenomenological analysis in terms of NB (Pascal) MD's of 
experimental data or available fits on collective variable properties, together with 
the QCD roots of the MD's itself, suggest to consider these results as a sound basis 
for the description of possible multiparticle production scenarios in the TeV energy 
domain accessible to future experiments at CERN LHC. The approach we decided 
to follow depends in a crucial way on the NB (Pascal) MD parameters behaviour 
and the problem one would like to solve first is how to extrapolate these parameters 
(fii and ki (i = soft, semi-hard)) from the GeV to the TeV energy region [[93, |94Q . 

4.7.1 The average charged multiplicities. 

Extrapolations from UA5 collaboration results on soft events suggest 

rwt(v^) = -5-54 + 4.72 In yfi, (152) 
and estimates for semi-hard events from UA1 collaboration [91] give 

^semi-hard ( ~ 2n soft (v/s) (case A) (153) 
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Figure 32: Energy dependence of the superposition parameter a (fraction of soft events) in the two 
cases of a linear (solid line, Eq. (1531 )) and quadratic (dashed line, Eq. (154) ) dependence of the 
average multiplicity of the semi-hard component on cm. energy. The triangles are the result of the 

UA5 analysis Q. 



80 



60 



40 



20 







1 1 1 1 II II 1 


1 1 


1 1 1 1 1 1 | 1 1 1 1 1 1 1 1 | 






. • s 
. f 
s s _ 






S ' s ** 

y ** ■ — 

.3' 

S <•* ■ — 












A ' ^ 








1 


1 1 


1 nl 



10 100 1000 10000 

cm. energy (GeV) 



Figure 33: Average multiplicity n vs. cm. energy. The figures shows experimental data (filled 
triangles) from ISR and SPS colliders, the UA5 analysis with two NB (Pascal) MD's of SPS data 
(circles: soft component; squares: semi-hard component), together with our extrapolations (lines: 
dotted: total distribution; dashed: soft component; short-dashed: semi-hard component, Eq. (153) ; 
dot-dashed: semi-hard component, Eq. (154) |93|] . 
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or 

^semi-hard 

(x/i) = 2n soft (v^) + 0.1 In 2 ^ (case B). (154) 

Since the average charged multiplicity is well parametrised by n tota i(-v/s) = 3.01 — 
0.474 In s/s + 0.75 In 2 ^fs and the total charged average multiplicity n total in terms 
of the superposition of two components with weight a turns out to be 

rkotal(\/s) = afl ™f\. + (1 ~~ «)^semi-hard, (155) 

it follows that 

a = 2 - n t otaiMoft (156) 

in the case (A) and 

a = 1 + (n soft - ritotai)/ ("-soft - 0.1 In 2 y/s) (157) 

in the case (B). We expect therefore for cases (A) and (B) in the TeV region the 
general trends of a and the average charged multiplicities shown in Figures [32] and 

S3 



4.7.2 The k and ki (z=soft, semi-hard) parameters. 

For the soft component it is not too daring to assume that KNO scaling is not vio- 
lated in the new energy domain, and accordingly to decide that /J 2 oft /fi 2 so f t remains 
approximately constant 0.143) with fc soft above 200 GeV cm. energy « 7. As- 
sumptions for the semi-hard component are more delicate and at least three possi- 
bilities which could characterise different scenarios should be examined. 

In scenario 1, being /c sem i-hard in Fuglesang's fit [42] approximately 13 at 900 
GeV and n sem i-hard even larger than ri soft it is assumed that A; semi . hard remains approxi- 
mately constant with -D 2 emi . hard /n 2 semi . hard ps 0.09, i.e., in this scenario KNO scaling 
is not violated in the two separate components contributing to the total sample of 
events. 

In scenario 2, a strong KNO violation is assumed with -D s 2 emi . hard /r?. 2 emi _ hard grow- 
ing logarithmically with cm. energy above 200 GeV and /c se mi-hard falling from 79 
at 200 GeV to « 3 at 14 TeV (fc semi . hard « 1/ In y/s). 

Scenario 3 is the QCD-inspired one, i.e., 

^semi-hard = « + V^ong (158) 

with a strong « (InQ/Qs)' 1 , Q and Q s are respectively the initial virtuality and the 
cut-off of the parton shower. Q s , a and b are determined by a least square fit to the 
values of /c sem i-hard given by UA5 collaboration at 200 GeV, 546 GeV and 900 GeV. 
It follows 

I o.42 

^mi-hard = 0-38 - ^ ^^i/lO) (159) 
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Figure 34: Clan parameters N (panels in the left columns) and n c (panels in the right column) are 
plotted for the scenarios described in the text (from top to bottom: first row: scenario 1; second 
row: scenario 2; third row: scenario 3). The figures shows experimental data (filled triangles) from 
ISR and SPS colliders, the UA5 analysis with two NB(Pascal) MD's of SPS data (circles: soft 
component; squares: semi-hard component), together with our extrapolations (lines: dotted: total 
distribution; dashed: soft component; short-dashed: semi-hard component) |)93|. 
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The decrease of fc sem i-hard i s milder than in scenario 2, it goes from 13 at 200 GeV to 
7 at 14 TeV; interestingly, the QCD inspired behaviour of fc sem i-hard is intermediate 
among the previous two, with no KNO scaling and strong KNO scaling violation 
respectively, which describe probably quite extreme situations. 



4.7.3 Clan structure analysis of pp collisions in the TeV region 

Following the above general discussion on the possible behaviours of NB (Pascal) 
MD's parameters in the TeV region, their interpretation in terms of the average 
number of clans, N, and of the average number of particles per clan, n c , reveals 
unsuspected new features which deserve particular attention (See Figured]). 

Of course, in scenario 1 , iV so f t and iV sem i-hard are expected to continue to increase 
— although fc SO ft and fc S emi-hard remain constant — in view of the increase of the av- 
erage charged multiplicity in each component. Clans are quite numerous but their 
size (i.e., n C!SO ft and n CiSem i-h ar d) although still growing with cm. energy remains quite 
small. Notice that the behaviour of clans of the soft component is the same by as- 
sumption in all scenarios. The situation is similar to what we know from the GeV 
region. What makes this analysis striking is the remark that iV sem i_hard in the 2nd and 
3rd scenarios becomes smaller with the increase of the cm. energy, and the average 
number of particles per clan very large. The phenomenon is enhanced in particu- 
lar in scenario 2 where -D s 2 emi . hard /n 2 sem i-hard shows a dramatic increase with energy 
as requested by strong KNO scaling violation. The reduction of iV sem i-hani and the 
simultaneous quick increase of n sem i-hard suggests that in the scenarios 2 and 3 the 
available cm. energy goes more in particle production within a clan rather than in 
new clan production, differently from what was found in scenario 1 . 



Data on MD's at 1.8 TeV cm. energy (from E735 experiment [1951]), when com- 
pared with our predictions are closer to scenario 2 but with an even wider MD. It 
is to be stressed that E735 results on full phase-space multiplicity distributions do 
not completely agree with those obtained at comparable energies at the SppS col- 
lider by the UA5 Collaboration [|27j, Um, see Fig. [351 Tevatron data are more precise 
than SppS data at larger multiplicities (they have larger statistics and extend to larger 
multiplicities than UA5 data), but much less precise at low multiplicity. Both sets of 
data show a shoulder structure, but the Tevatron MD is somewhat wider. It should 
be noticed that E735 data are measured only in \rj\ < 3.25 and p T > 0.2 GeV/c 
then extended to full phase-space via a Monte Carlo program. Notwithstanding the 
mentioned discrepancy, there is consensus in accepting the presence of (at least) 
two substructures in the data, either related to the varying impact parameter [ 196 ] or 
to multiple parton scattering [[971, 98] . In terms of the scenarios discussed in this 
Section, one can reasonably argue that scenario 1 is the less probable to occur and 
that therefore the above mentioned finding on clan reduction in scenarios 2 and 3 
raises some interesting questions on the properties of clans themselves, i.e., how far 
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Figure 35: E735 results on charged particle multiplicity distributions in pp collisions at various 
energies in full phase-space compared with UA5 results at similar energies. Data from the two 
experiments which were taken at nearly the same energy are rescaled by the same factor [19511 . 



the number of clans of the semi-hard component could be reduced in our frame- 
work? is the reduction to one or very few clans compatible with the structure of the 
semi-hard component? should we expect, with the increase of the cm. energy of 
the collision, the onset of a new component to be added to the previous two? The 
attempt to answer these questions is postponed to the next sections, where also we 
would like to address the related question on the real nature of clans, which seems 
quite natural at this stage of our search, i.e., are clans observable (massive) objects? 

4.7.4 Analysis in pseudo-rapidity intervals 

We would like to point out that our study on the possible three scenarios in the TeV 
region based on extrapolations of the experimental knowledge of the GeV energy 
domain can be extended from full phase-space to rapidity intervals. 

Since only after the classification of events (soft and semi-hard) has been carried 
out do we look at phase-space intervals, it seems quite reasonable to assume that the 
weight factor a depends on the cm. energy (as in full phase-space) and not on the 
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Table 2: Values of 1 /fc tota i, 1 /fc so ft and 1 / fc sem i-hard m our extrapolations for different rapidity intervals 
and for full phase-space (FPS). l/fc so ft and, in scenario 1, l/fc sem i-hard, are energy independent and 
their value is given in the table; in the other cases we give the parameters and the form of the energy 

dependence |94j| . 



interval 


soft comp. 


scenario 1 


scenario 2 


scenario 3 


Irjl <C ?1„ 
I'/l — '/c 


a sott 


k~ l 

semi-hard 


k^,(n„ i/s) = 

""totalv/C) V ° J 

a + b In ^fs 


"'semi-hard V'/C) V *V 

C + D/y/hx{^/10) 


Vc = 1 


0.294 


0.217 


a = 0.02 
b = 0.08 


C = 0.97 
£> = -1.6 


Vc = 2 


0.286 


0.172 


a = -0.06 
b = 0.08 


C = 0.88 
/J = -1.5 


Vc = 3 


0.250 


0.156 


a = -0.12 
6 = 0.08 


C = 0.72 
D = -1.2 


FPS 


0.143 


0.077 


a = -0.082 
b = 0.0512 


C = 0.38 
D = -0.65 



pseudo-rapidity interval r\ c . 

The ?7 C dependence comes therefore from fti and ki parameters only. 

In order to be consistent with our assumptions in full phase-space, the single 
particle density must show an energy independent plateau which extends in pseudo- 
rapidity for some units around 77 = in each direction. Accordingly, the height of 
the plateau (n ) of the soft and semi-hard component is fixed equal to 

n ,soft ~ 2.45 (160) 



(a value compatible with low energy data [ 180 ] where only the soft component is 
present) and 

™0,semi-hard ~ 6.4 (161) 

(notice that the assumption of an energy independent plateau for the semi-hard 
component is not compelling and that the following increase with cm. energy 
of ^o,semi-hard is again compatible with all our previous discussion: no,semi-hard ~ 
6.3 + 0.07 In v/i). 
Finally 

ni(r) c ) = 2n 0ji r] c (i = soft, semi-hard) (162) 

and 

«total(?7c, Vs) = a S oft(\/s)^soft(?7c) + [1 - "soft (v^)] ^semi-hard (Vc)- (163) 
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Figure 36: The average number of clans, N, (top two rows) and the average number of particles per 
clan, n c , (bottom two rows) are plotted against the cm. energy for three rapidity intervals (dotted 
line: rj c = 1; dash-dotted line: r\ c = 2; dashed line: r\ c = 3) and for f.p.s. (solid line), for each 
scenario (in columns, from left to right: scenario 1, 2 and 3) and for each component (in rows, from 

top to bottom: soft and semi-hard) ^4\\. 
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By recalling that 



ijUfac V*) ( 1 + 7 r — 7=\ ) = "(v^Koft^ Vs) ( 1 + 1 — T - " — 7=7 J 

V KtDtal(^c, V s )/ V KoniVc V s ) J 

+ (1 - a(v^))^se ml -hard(^, V^) U + T ^ 7=7 J (164) 

\ K semi-hardl 1 '/c; V s ) J 

results contained in table [2] are obtained. Clan structure analysis confirms the gen- 
eral trends found in full phase-space, favouring a lower number of clans of smaller 
size in more restricted rapidity intervals; of particular interest is again the semi-hard 
component. The results are illustrated in Fig. [361 In scenario 1, as the energy in- 
creases one notices a copious production of clans of nearly equal size in all rapidity 
intervals. In scenario 2, in all pseudo-rapidity intervals, to the higher aggregation 
of newly created particles into existing clans it follows the aggregation of clans 
into super-clans favouring stronger long range correlations. Scenario 3 (the QCD- 
inspired one) leads to predictions which are — as usual — intermediate between the 
previous two. 

4.8 Hints from CDF 

At the Tevatron, the subsample of events with no energy clusters above 1 . 1 GeV and 
the subsample of remaining events were separated in the total sample of pp events 
collected in the pseudo-rapidity window \rj\ < 1 with CDF minimum bias trigger at 
s/s = 630 GeV and at s/s = 1800 GeV & 

Being the two samples of events highly enriched in soft and hard interactions 
respectively and the Collaboration quite aware of the difficulty of a correct identifi- 
cation and separation of jets with transverse energy lower than 5 GeV, events with 
no clusters were called 'soft' and those with at least one cluster, 'hard', n charged 
particle multiplicity distributions for the minimum bias samples at the two energies 
were plotted in the KNO form and showed, in the interval \q\ < 1, KNO scaling vi- 
olations. When the two samples of events are separately plotted the soft component 
satisfies KNO scaling, whereas the hard component clearly violates KNO scaling, 
suggesting that the behaviour of the second component is the main cause of KNO 
scaling violations in the full sample, as shown in Fig. 571 

Interestingly the mean transverse momentum (p T ) when plotted versus n charged 
multiplicity for the full minimum bias sample is larger at 1 800 GeV than at 630 GeV, 
whereas it does not grow with the cm. energy in the soft sample (See Fig. [381) The 
different behaviour at the two energies in the full minimum bias sample is there- 
fore entirely due to the hard sample and it is probably the effect of high transverse 
energy interactions (mini -jets in the literature). 

These results, although limited to a single pseudo-rapidity interval only, provide 
an interesting experimental support to the two main assumptions which justified our 
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Figure 37: Multiplicity distributions for the full minimum bias, the 'soft' and the 'hard' samples at 
1800 and 630 GeV from CDF; data are plotted in KNO variables. In the bottom panel of each figure 
the ratio of the two above distributions is shown. The two continuous lines delimit the band of all 

systematic uncertainties [990. 
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Figure 38: Mean transverse momentum vs multiplicity for the full minimum bias, the 'soft' and the 
'hard' samples at 1800 and 630 GeV from CDF. On the bottom panel of each figure the ratio of the 

two curves is shown [ jggj l . 
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approach to the TeV energy domain in pp collisions, i.e.: 

a. that there are at least two components in the total sample of events at thresh- 
old of the TeV energy domain (they are called 'soft' and 'semi-hard' in Ref. [[420 
and by us (Sections [43J and 14.71 ), and 'soft' and 'hard' in the paper by the CDF 
Collaboration); 

b. that the soft component satisfies KNO scaling. 

In addition, as far as the second component (the semi-hard one) is concerned, 
the scenario with strong KNO scaling violation or the one based on QCD inspired 
behaviour seem favoured with respect to the scenario with KNO scaling. 



4.9 Forward-backward multiplicity correlations. The demand for the existence of 
clans. 

The experimental problem consists in studying, in each event, the number of parti- 
cles falling in the forward hemisphere (F) and in the backward hemisphere (B) [33]. 
Usually, the average number of particles in the B hemisphere, n B , is successfully 
parametrised in terms of the number of particles in the F hemisphere, n F , according 
to the following linear relation: 

n B (n F ) = a + bpB n F, (165) 

where &fb is the FB multiplicity correlation strength: 

Cov[n F ,n B ] 
FB " (Var[n F ]VarM)V2 

_ {{n F - n F )(n B - n B )) (166) 

" [((n F -n F y)((n B -n B y)} 1/2 ' 

The existence of strong correlations between particles in the two hemispheres is 
an important effect and could be a signal of large colour exchange among partons 
at parton level. It is instructive with this aim to examine the experimental situation 
on FB multiplicity correlations in pp collisions and e + e _ annihilation. 

In pp collisions the F hemisphere coincides with the region of the outgoing 
proton, the B hemisphere is the symmetric region in the opposite direction. In pp 
collisions, 6 FB is growing with cm. energy and is rather large. At 63 GeV (ISR) 
6 FB = 0.156 ± 0.013 and at 546 GeV (UA5) b FB = 0.43 ± 0.01 in 1 < \r}\ < 4 and 
&fb = 0.58 ± 0.01 in < \r)\ < 4. Its general trend with cm. energy is given by 



^FB 



-0.019 + 0.061 Ins (167) 



In e + e~ annihilation the F hemisphere is chosen randomly between the two hemi- 
spheres defined by the plane perpendicular to the thrust axis. OPAL collabora- 
tion has found 6 FB = 0.103 ± 0.007 for the total sample of events and 6 FB ~ 



82 



in the separate two- and three-jets sample of events. TASSO Collaboration finds 
bpB = 0.080 ± 0.016 in the total sample of events, we do not have estimates in this 
case of b FB in the separate samples of events. In conclusion in e + e _ annihilation 6 FB 
grows with energy but is rather small. 

Accordingly our interest on the theory side was limited to symmetric reactions 



only [133I], although a generalisation to asymmetric collisions can be worked out 

horn. 

The number of F and B particles, n F and n B , are of course random variables 
with n F + n B = n and 

P n = P totai(n F ,n B ). (168) 

n=np+ns 

ftotai^F, tib) is the joint distribution for the weighted superposition of different 
classes of events and is equal to aPi(n F , n B ) + (1 — a)P 2 (n F , n B ); a is the weight 
of class 1 of events with respect to the total sample of events. The calculation of 6 FB 
according to Eq. (11661 ) leads then to the following result: 

ahDlJjl + 6i) + (1 - a)b 2 DlJ(l + b 2 ) + \a(l - a)(n 2 - n,) 2 
' FB ~ aD 2 n J {I + 6i) + (1 - a)D 2 n 2 /(l + b 2 ) + \a{l - a){n 2 - m) 2 

(169) 

Here b\ and b 2 are the correlation strengths of events of class 1 and 2, D\ and D 2 the 
corresponding dispersions and fix and n 2 the corresponding average multiplicities. 

For b\ =b 2 = 0, one has 6 FB — > 612 and the previous formula is reduced to the 
following one 



ha(l — a)(n 2 — ni) 2 
— - L — = = — . (170) 

aD 2 nX + (1 - a)Dl 2 + \a{l - a){n 2 - n x ) 2 



It is a general property of the two formulae to be independent of the specific 
form of the rii particles and n 2 particles multiplicity distributions, P nx and P n2 , and 
to depend on a, fix, n 2 , D x and D 2 parameters only. 

OPAL collaboration measured FB multiplicity correlation strengths in e + e~ 
annihilation in the two separate samples of events (2-jet sample and 3-jet sam- 



ple) and in the total sample [189] . It has been found that b\ ~ b 2 w and 
b FB = b 12 = 0.103 ± 0.007 respectively. Being a = 0.463, n x = 18.4, D 2 = 25.6, 
n 2 = 24.0 and D\ = 44.6, the theoretical estimate of b 12 according to formula (1701 ) 
turns out to be 0.101 in perfect agreement with the experimental value 0.1 03 ±0.007. 
The success of our formula (1701 ) in describing FB multiplicity correlations in e + e~ 
annihilation should be contrasted with its failure in describing 6 F b hi pp collisions: 
b\ and b 2 are expected to be different from zero in the latter case and the more 
general formula (1691 ) is needed. The parameters of such general formula can be 
computed under the assumptions that : 
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a) particles are independently produced; 

b) they are binomially distributed in the F and B hemispheres; 

c) the overall MD for each component (soft and semi-hard) is a NB (Pascal) MD 
with characteristic parameters n.j and fcj (i — 1, 2). 

It can then be concluded that charged particles FB multiplicity correlations are not 
compatible with independent particles emission and that charged particles FB mul- 
tiplicity correlations are compatible with the production of particles in clusters (i.e., 
clans in view of assumption c). It follows that: 

a. the joint probability distribution P(np,nB) is written as the convolution 
over the number of produced clans and over the partitions of forward and backward 
produced particles among clans; 

b. the symmetry argument should be used; 

c. the introduction of a leakage parameter p is needed: the new parameter p con- 
trols the probability that a binomially distributed particle (and generated by one clan 
lying in one hemisphere) has to leak in the opposite hemisphere (p = 1 means no 
leakage, the variation domain ofpisl/2<p<l and p < 0.5 implies that the clan 
to which the emitted particle is belonging is classified in the wrong hemisphere); 

d. a covariance parameter, 7, between F and B particles within a clan is also 
introduced; 

e. within this framework clans are binomially produced in the F and B hemi- 
spheres with the same probabilities and particles belonging to a clan are indepen- 
dently produced in the two hemispheres. 

From these assumptions one gets that: 

i. clan structure analysis is applicable for each component, 

ii. clans in each component are independently emitted (their distribution is Pois- 
sonian), 

Hi. clans are binomially distributed in the two hemispheres, 
iv. logarithmically produced particles from each clan are also binomially dis- 
tributed in the F and B hemispheres but with different probabilities (the correspond- 
ing leakage parameters are different). 

The parameters of formula (11691 ) can then be calculated. We find indeed for the 
single component strength 



Formula (|171| ) is valid for the class of CPMD's. Assuming in addition, according 
to iv, that clans in each component are of NB (Pascal) MD type with ki and 




D% + A(d 2 Np (N))(p - q) 2 - AN^/n 2 + 2NDlJnl 
D 2 /n - D 2 /n c - A(d 2 NF (N))(p - q) 2 n c /N + 4 7 /n c 
D 2 Jn + D 2 /n c + 4(d 2 (N)){p - q) 2 n c /N - 4 7 /n c ' 



(171) 
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Figure 39: Predictions for the correlation coefficients for each component (soft and semi-hard) and 
for the total distribution in pp collisions. For each scenario, three cases are illustrated, correspond- 
ing to the three numbered branches: leakage increasing with y/s (upper branch, ©), constant leakage 
(middle branch, ©) and leakage decreasing with (lower branch, ©). Leakage for the soft compo- 
nent is assumed constant at all energies. The dotted line is a fit — see Eq. (1671 ) — to experimental 

values Q. 

parameters, one has 

4b = 2niPi(l 

with Pi = fiij [fii + ki). It should be remembered that the leakage parameter pi is the 
fraction of particles within one clan which fall in the same hemisphere where the 
clan was produced. Assuming that the semi-hard component is negligible at s/s = 
63 GeV and knowing 6fb(a/s = 63 GeV) = 0.156 ± 0.013 from experiments, and 
of course n so f t , k so f t leakage parameter for the soft component can be determined. 
We find p sott = 0.78, i.e., 22% of the particles are expected to leak from clans in 
one hemisphere to the opposite one. 

The relatively small increase of the average number of particles per clan from 63 
GeV to 900 GeV for the soft component (n c goes from « 2 to « 2.44) suggests to 
consider p soft constant throughout the GeV energy region. Accordingly, being n so f t 
and A; sott known at 546 GeV, &FB,soft for the soft component can be determined at 
such energy; its value, inserted in Eq. (11691 ), allows the determination of &FB,semi-hard 
in view of the fact that 6 F B,totai = 0.58. It is found that Psemi-hardCv^ = 546 GeV) = 
0.77. The relatively small increase of the average number of particles per clan also 
for the semi-hard component, from 200 GeV up to 900 GeV (n c goes from 1.64 to 
2.63), suggests to take also p sem i-hard constant in the GeV region. 



Pi)/(rii + ki- 2riiPi(l - pi)) 
Pi)/(1- 2/3 iPi (l -pi)), 



(172) 
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Figure 40: Results of the weighted superposition model for ns{np) vs. np compared to experi- 
mental data in full phase-space at 63 GeV (a) and in the pseudo-rapidity interval \i]\ < 4 at 900 GeV 

(b)Q. 



Under the just mentioned assumptions: 

a) the correlation strength cm. energy dependence is correctly reproduced in the 
GeV energy range from ISR up UA5 top cm. energy and follows the phenomeno- 
logical formula 6 FB = -0.019 + 0.061 Ins (Figure S3). 

b) n B {n F ) vs n F behaviour at 63 GeV cm. energy (ISR data) is quite well 
reproduced in terms of the soft component (a single NB (Pascal) MD) only and at 
900 GeV cm. energy (UA5 data) in terms of the weighted superposition of soft and 
semi-hard components, i.e., of the superposition of two NB (Pascal) MD's. (Figure 

a®. 

Concerning the general trends of &fb vs -/s and of /^(np) vs n F in the TeV 
energy domain, the situation is not unanimous. In some approaches one expects a 
continuous increase towards the value 1 : either reasoning on the use of a soft and 
a hard component within the eikonal model for mini -jets production [101] o r (in a 
purely statistical analysis) on the independent production of pairs of particles [ 102] , 
rather than of individual particles or clusters/clans. The three scenarios extrapolated 
from our knowledge on pp collisions in the GeV region and discussed in Section \4J\ 
provide different predictions. Being for instance in all scenarios at 14 TeV n CiSoft ps 
2.98 (n C)SO ft is ~ 2 at 63 GeV and « 2.63 at 900 GeV) to assume p so f t constant 
(~ 0.78) also in the new energy domain seems quite reasonable. The situation for 
Psemi-hard is different. With the exception of the first scenario for which the validity 
of KNO scaling is assumed also for the semi-hard component and n CiSe mi-hard goes 
from « 2.63 at 900 GeV to ~ 3.28 at 14 TeV after a very weak increase in the 
GeV energy region, p sem i-hard constant is an unrealistic assumption; n CiSe mi-hard in the 
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scenario 1 scenario 2 scenario 3 




10 10 2 10 3 10 4 10 10 2 10 3 10 4 10 10 2 10 3 10 4 

cm. energy (GeV) 



Figure 41: Energy dependence of the correlation coefficients for each component (soft and semi- 
hard) and for the total distribution in pp collisions. The dotted line is a fit — see Eq. (11671 ) — to 

experimental values f| 1 03| l - 

second scenario where strong KNO scaling violation is allowed is indeed « 2.63 at 
900 GeV but « 7.96 at 14 TeV and in the third scenario (the QCD inspired one) the 
^c,semi-hard values at the two energies are ps 2.5 and ps 5 at the two energies. 

What should be noticed is that in all above mentioned scenarios &fb is bending 
in the new region and that 6 F b bending is more pronounced if p S emi-hard increases 
(less particle leakage from clans) and it is less pronounced when j> sem i-hard decreases 
(more particle leakage from clans). 6pB,totai bending can be therefore reduced by 
allowing more particle leakage from individual clans, i.e., by allowing a larger par- 
ticle flow from one hemisphere to the opposite one. The real question is how far 
in energy the empirical fit on &fb, total c.m. energy dependence will continue to find 
experimental support without spoiling the main assumptions of our approach to 
FBMD correlations. 

It is indeed instructive to reexamine the three scenarios extrapolated in the TeV 
energy domain in order to include in our scheme the 6 FB , total points at 1000 GeV and 
1800 GeV measured by E735 Collaboration at Fermilab to sit on the same straight 
line in Figure \4\\ of the other points at lower c.m. energies. It is remarkable that 
the inclusion of the new points demands that p sem i-hard must decrease and accord- 
ingly particle leakage from one hemisphere to the opposite one must increase in all 
scenarios. 

Satisfactory results for &FB )to tai ener gy dependence are obtained in all three sce- 
narios in Figure @J]by taking p soft w 0.8 and p semi _ hard ~ 0.84 — 0.07 ln(- v /s/200) for 
y/s > 200 GeV. 
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It should be pointed out that in the scenario characterised by a semi-hard com- 
ponent with strong KNO scaling violation (scenario 2) FBMC strength has a less 
steep behaviour with the increase of the cm. energy of the collision and its satura- 
tion towards 1 — as that of &FB jto tai — is quicker than in the other scenarios. It turns 
out that the linear behaviour of &FB,totai with the increase of cm. energy shown in 
Figure 0T] is incompatible with our approach to FBMD correlations above 2.5 TeV 
in scenario 1, above 3.5 TeV in scenario 2 and above 5 TeV in scenario 3. By in- 
compatible we mean that leakage parameter p sem i-hard cannot be adjusted in order to 
reproduce the eventual linear behaviour of &FB,totai above the mentioned cm. energy 
extreme values in the different scenarios without spoiling the model itself. 

On the contrary the experimental finding of the mentioned linear fit of &fb, total 
at higher cm. energy, say at LHC, would demand in our approach the existence of 
a third class of events to be added to the soft and semi-hard ones with even larger 
leakage of particles from clans in one hemisphere to the opposite one than that given 
by the a logarithmic decrease with cm. energy of p sem i-hard- 

It seems that in the new class clan production is therefore disfavoured with re- 
spect to the production of more particles within clans. The main characteristic of 
such events is to be composed of few high particles density clans generated by nec- 
essarily high virtuality ancestors. This fact if confirmed would have an interesting 
counterpart at parton level and allow a suggestive interpretation of the completely 
different behaviour of the FB multiplicity correlations in pp collisions and e + e~ 
annihilation. 

As already seen, in e + e~ annihilation, FB multiplicity correlations are almost 
inexistent in the two component and very weak in the total sample of events (an 
effect entirely due to the superposition of the two classes of events). Here clans are 
numerous but quite small in size, i.e., the independent intermediate gluon sources 
(the BGJ's) — the clan ancestors at parton level — are expected to be generated quite 
late in the production process at relatively low virtualities, implying small colour 
exchange among partons and then very weak FB multiplicity correlations among 
hadrons. 

In pp collisions clans are less numerous than in e + e~ annihilation but their size 
is larger, i.e., more particles are contained within each clan. At parton level one 
should expect that BGJ's are generated quite early in the production process at 
relatively high virtualities with enough room for many parton splittings and con- 
sequently stronger colour exchanges. All that at hadron level will lead to quite 
remarkable FBMC's. In this framework the eventual third class of events whose ex- 
istence would be determined by the observation of a linear increase of &FB,totai with 
cm. energy in the proper TeV energy domain will have some peculiar character- 
istics, which make its finding particularly appealing. First of all one should see in 
these events a reduction of the average number of clans to few units in order to guar- 
antee a high particle population per clan and a quite consistent particle leakage from 
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n 



Figure 42: (z 2 ) n vs n at 900 GeV in the interval 1 < \q\ < 4. Data points are from UA5 Collab- 
oration, the solid line is the result of our model in < \r)\ < 4, the dash-dotted line is a linear fit 

fioi. 

one hemisphere to the opposite one. At parton level the fewer BGJ's production of 
the new class of events with respect to the semi-hard one should be originated by a 
very high virtuality process — much higher than for semi-hard events — leading to 
high parton density regions characterised by huge colour exchanges. 

Another variable sometime used in the literature in addition to 6 FB should be 
mentioned: it is the average of the forward backward particle multiplicity difference 
over all events at fixed multiplicity n, i.e., (z 2 ) n = (n F — n B ) n . The new variable 
works at a deeper level of investigation than 6 FB (the latter is related to the average of 
(n F —riB)n overall multiplicities) and it is of particular interest for global properties 
of the collisions related to average n. In our framework one has 

, 2 v 4c C,softH aP soft(n) | 4^ F Semi , hard (n)(l - a)P semi . hard (n) 

"total [n) -Ptotal(n) 
With PtotalO) = aP soft (n) + (1 - a) P semi . hard (n) . 

In Figure 321 data points from UA5 collaboration in 1 < \rj\ < 4 at 900 GeV on 
(z 2 ) n vs. n are compared with our model results in full phase-space (solid line) as 



well as with the behaviour predicted in the model of Chou and Yang [ 104] (dashed 



line). The latter model supposes a composition of 'stochastic' (i.e., Poissonian) 
fluctuations in z with 'non-stochastic' ones (in n, obeying KNO scaling), compo- 
sition which is regulated by the collision geometry; in this scheme one obtains a 
linear dependence of (z 2 ) n on n. Below n pa 40, no differences between the two 
models are visible and both models are in agreement with experiments. Above 
n PS 40, our model shows a clear "hump" structure. The situation in such range 
is not experimentally clear and consequently no conclusions can be drawn on this 
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point. Predictions on (z 2 ) n vs. n behaviour in t he po ssible scenarios expected in the 
TeV energy domain are discussed in reference Q103 ]. The behaviour of t he va riable 
(z 2 ) n has also been used as one possible evidence for a phase-transition [105 ]. 



4.10 Are clans massive objects? 



The success of clan structure analysis in interpreting multiparticle production phe- 
nomenology raises, as pointed out at the end of section [4771 many intriguing ques- 
tions. Among them the problem of the clan mass is of particu lar i nterest. The first 
search on the subject goes back to A. Bialas and A. Szczerba [106 ] . The attempt is 
based on a generalisation of standard clan Poissonian production mechanism with 
two additional assumptions on clans distribution in rapidity variable and on the 
angular distribution of particle decay in a clan. Let us review briefly the main con- 
tent of this search. Clans, Poissonianly distributed and independently emitted in 
bremsstrahlung-like fashion, are characterised using energy and (longitudinal) mo- 
mentum conservation by the following single-clan (pseudo)-rapidity density 



dN w 



X 



+^A/ 



x 



(174) 



witha^ = (m T /\/s)e ±ri . 

Notice that A is the plateau height (the average number of clans per rapidity 
unit), m T the clan transverse mass (% = yjm 2 + p^), rj is the clan (pseudo)- 
rapidity and y/s the cm. energy (clan emission is limited to the interval [77] < 
hi{y/s / mr) . Furthermore particle probability density function inside a clan (as- 
sumed to be in 770) on the hypothesis of isotropic decay is given by 



2lo cosh 



2 (V-Vo 



UJ 



~i -1 



(175) 



where u is a free parameter; distribution amplitude is proportional to uj and u> = 1 
corresponds to an isotropic decay. 

By assuming next that particles will be produced in each clan according to a 



logarithmic MD, whose generating function is given by g loe 



1b(1-z/3)/1h(1- 



j3) = J2 n Pn° g)zTl an( l me average number of particles per clan by 







(176) 



(0 - 1) ln(l - p) ' 

with (3 = n/(n+k), one finds that the generating function for the MD in the interval 
Arj turns out to be 



G(X, A77) = exp 



dN 
drj 



In 



l-i^KwATy-Xz-i; 

ln(l - J) 



dr] 



(177) 
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with 

p(r} ,Ai])= ^(rj,rj )drj . (178) 

J At? 

p(rjo, A77) describes the fraction of particles generated by a clan with (pseudo)- 
rapidity r] falling in the interval A77. It is assumed in addition that emitted particles 
by each clan are independently produced in rapidity. With a convenient choice of the 
parameters (A = 0.855, m T = 3.15 (GeV/c 2 ), lu = 1.45, (3 = 0.90) UA5 data at 546 
GeV cm. energy are approximately reproduced by the generating function of the 
model G(z, A77). The corresponding n charged particle MD, P n , is not a NB (Pas- 
cal) MD except in full phase- space and forward backward multiplicity correlations 
are not correctly reproduced by the model. These considerations notwithstanding, 
the above results are in our opinion quite instructive, although not satisfactory, in 
that they show a possible way to determine clan width and mass. Accordingly, we 
decided to extend the search on clan masses performed in reference [ 1106 ] in the full 
sample of events of the collision to the individual components contributing to the 
collision itself. The idea is that, if clans are massive, clan masses are very probably 
different in the different classes of events in a given colli sion and again presumably 
they differ from one class of collisions to the other [1 07] . 

Let us examine pp collisions first, assuming, in agreement with the results of 
our search, that we have two classes of events: with and without mini-jets. This 
distinction is, as we know, not unique and at the present stage of our knowledge we 
should rely on reasonable guesswork in order to strengthen our intuition in regions 
where experimental data on a collision are lacking or not performed in terms of their 
components. 

Although it is not clear how much semi-hard component events contaminate 
the soft sample, the assumption that at 63 GeV one has only soft events is well 
supported by the fact that the shoulder effect in P n vs n at such cm. energy is 
negligible (one single NB (Pascal) MD describes experimental data quite well). A 
NB fit to the data at 63 GeV in 7] c < 2.5 is indeed correctly reproduced with the 
following choice of the parameters of the generating function Eq. ( 1177) : A = 1.14, 
m T = 1.80 (GeV/c 2 ), u = 0.84, (3 = 0.79. They are obtained by fitting with the 
least squares method the average multiplicity (n) and the quantity D 2 / (n) 2 — 1/(71) 
of the distribution given by the generating function G(X, A77) to the corresponding 
moments of the NB (Pascal) MD (i.e., n and k^ 1 ) in the pseudo-rapidity intervals 
A?7 = [-77c, 77J with rj c < 2.5. 

The same method is then applied separately to soft and semi-hard components 
which, according to our experience, control the dynamics in pp collisions at 900 
GeV cm. energy. Above four parameters are fitted in the two components by using 
available data at 900 GeV collected in terms of NB fits in pseudo-rapidity inter- 
vals 1 77 c I < 1, . . . , 5. The fits to the average charged multiplicity and dispersion as 
well as to the average number of clans and to the average number of particles per 
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Figure 43: Fit to the average multiplicity and dispersion in different pseudo-rapidity intervals 
[— T] c , rjc] for the two components of the MD in pp collisions at 900 GeV: soft component: open 
circles (data) and solid line (fit); semi-hard c ompo nent: filled circles (data) and dashed line (fit) 

dm. 



clan in different (pseudo)-rapidity intervals for the two components contributing to 
the total MD turns out to be quite good as shown in Figure \43\ Clan density and 
single-particle pseudo-rapidity probability density in a clan separately for the two 
components are also shown in Fig. 341 

Clan transverse masses and plateau heights at 900 GeV are much higher in the 
semi-hard than in the soft component (m T semi . hard = 3.43 (GeV/c 2 ) ^> m T soft = 1.47 
and A sem i-hard = 2.09 ^> A sott = 0.92) whereas the distribution width is much higher 
in the soft (1.95) than in the semi-hard (1.35) component. This fact shows that 
heavier particles are produced more in the semi-hard than in the soft component. In 
addition, the average number of particles per clan is bending in larger rapidity inter- 
vals in both components suggesting that clans are larger in central rapidity intervals 
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Figure 44: Clan density dN / dr\, and single particle pseudo-rapidity probability density in a clan 
(p(r); 0), for the soft (solid line) and semi-hard (dashed line) component at 900 GeV cm. energy 



than in the peripheral ones. Accordingly, leakage parameters in FB multiplicity cor- 
relations should be larger when clans have larger masses and their particle content is 
distributed in more central rapidity intervals. Indeed, in the present framework, it is 
possible to write an approximate expression of the leakage parameter p (controlling 
FB correlations, see section [479] ): 



p=l + 



10 



2Ar] 



In 



(1 + 



-2Aj7/w> 



(179) 



this formula was obtained summing $(r/, 770) over V an d averaging the result over 
?7o E Arj. The extreme value of p, 1 and 1/2, are obtained respectively for uj — > 
and uj — > 00. 

An interesting application of our approach to the two- and three-jets components 
at LEP cm. energy in e + e~ annihilation shows that clan transverse masses mx are 
larger in the three-jets sample of events (1.10 GeV/c 2 ) than in the two-jets sample of 
events (0.62 GeV/c 2 ) and that both masses are much lower that the masses expected 
at ISR energy in pp collisions. In conclusion clans could have masses which vary 
with cm. energy in a collision, they are different in the different components of a 
collision and vary from one class of collision to another one. The word is again to 
experiments which should clarify in addition to clan masses properties the existence 
of eventual other clan quantum numbers. 
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4.11 The reduction of the average number of clans and signals of new physics in 
full phase-space and in restricted rapidity intervals 

It has been shown in previous Sections that the weighted superposition mechanism 
of two classes of events in high energy collisions explains a series of experimental 
facts assuming that the n charged particle multiplicity distribution, for each class of 
events, is described in terms of a NB (Pascal) MD with characteristic parameters 
and ki (with i = soft, semi-hard). The experimental facts we refer to are those that 
were presented in Section 14771 and successfully described in the previous sections: 
a) the shoulder structure in the intermediate multiplicity range; b) the quasi oscil- 
latory behaviour of the ratio of factorial cumulants, K q , to factorial moments, F q , 
when plotted as a function of its order q (after an initial decrease towards a negative 
minimum at q rs 5); c) energy dependence of the strength of forward-backward 
multiplicity correlations. All these facts implied the relevance of NB (Pascal) regu- 
larity for classifying different classes of events and of its interpretation in terms of 
clan structure analysis. 

Our attention is focused here on clan behaviour for the semi-hard component 
in the two most realistic scenarios (according to the results of CDF Collaboration 
mentioned in Section [4781) , i.e., scenarios 2 and 3. Clans general behaviour at 900 
GeV and 14 TeV is summarised in the following table 





N (900 GeV) N (14 TeV) 


n c (900 GeV) n c (14 TeV) 


scenario 2 

k sh ~ (logv/s) -1 


23 11 


2.5 7 


scenario 3 

k S h ~ (\7logs) -1 


22 18 


2.6 5 



In going from the GeV to the TeV energy domain iV sem i.hard decreases and 
^c,semi-hard increases as requested by a clan aggregation process with higher parti- 
cle population per clan. 

Maximum clan aggregation would correspond of course to the reduction of 
-^semi-hard — > 1> i.e., remembering the definition 



-^semi-hard ^semi-hard 

In 1+ , (180) 

\ "^semi-hard / 

to the following relation between n sem i-hard and k ssmi . hm:d 

^■semi-hard = ^-semi-hard [6Xp(l//c sem j-hard) 1]? (181) 

which, being n semi . hMd > k semi . hMd , implies /c sem i-hard < 1. 

Natural questions are: is it that just an asymptotic property of the semi-hard 
component, or the characteristic property of an effective third class of events to be 
added to the soft and semi-hard ones? 
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Figure 45: n charged particle multiplicity distribution P n predicted for minimum bias events in full 
phase space by Pythia Monte Carlo (version 6.210, default parameters using model 4 with a double 
Gaussian matter distribution) at 14 TeV cm. energy, showing two shoulder structures | |l08 [|. 



Being the asymptotia of the semi-hard component corresponding to iV semi . haid — > 
1 a quite extreme cm. energy region in scenarios 2 and 3, completely outside the 
energy range available to future experiments, the first choice seems too far. On 
the contrary the suggestion to try an answer to the second question is of particular 
interest. In order to proceed along this line, let us anticipate at 14 TeV the new class 
of event with the above mentioned property dictated by semi -hard clan reduction 
(i.e., A; third < 1) [H]. 

It is interesting to remark that total n -particle MD, P^ otal , for the minimum bias 
event sample in full phase-space from Pythia version 6.210, (default parameters, 
with double Gaussian matter distribution, model 4), when plotted vs. multiplicity n, 
shows at 14 TeV a two-shoulders structure (Fig. 351) The second shoulder is similar 
although lower than the first one. In order to interpret the second shoulder one needs 
a third class of events: the second shoulder is the superposition of the events of the 
second class and those of the third class. 

Therefore we decided to explore in our scenarios the consequences of the exis- 
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Figure 46: Multiplicity distributions, in KNO form, with various values of k and the same average 
multiplicity n, show different curvatures | )l08j| . 



tence of a third class of events with A; third < 1 already at 14 TeV. 

The fact that k third is less than one has important consequences and fully charac- 
terises the properties of the new class : 

a. Two particle correlations are stronger in the event of the third class than in 
the event of the semi-hard component 

^thirdAthird = J C 2 (t]i, ri2)driidri2 > n 2 semi . hs[d / k semi . hard . (182) 

b. Since cumulants depend on 1 / fc tmrd which is much higher than 1 / fc serm -hard also 
cumulants of the third component are in general much larger than cumulants of the 
semi-hard component. 

c. Since iVt hird = 1 it follows that leakage parameter p third = 1/2, i.e., it reaches 
its maximum value, being also 6 third rs 1 from 

26 tmrd p t hi rd ,.„„, 

FB,third — Z 7\ \: (io^J 

1 — ^OthirdPthirdU — PthirdJ 

one has that 6 FB> third — > 1 and that also forward backward multiplicity correlation 
strength of the third class of events is larger than that of the semi-hard component. 
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Figure 47: n charged particle multiplicity distribution P n expected at 14 TeV in full phase-space in 
presence of a third (maybe hard) component with iV t hird = 1, showing one shoulder structure and 
one 'elbow' structure. The band illustrates the range of values of parameters n t hird, fcthird and a t hi r d 

discussed in the text | |l08 [|. 

It should be remembered that the new class of events is described by a NB 
(Pascal) MD with n third 3> k third and that A; third <C 1, i.e., a log-convex gamma MD, 
well approximated for fc t hi r d — > by a logarithmic distribution (Fig. \46[ ) 

It is assumed therefore that the classes of events contributing to the total sample 
at 14 TeV are the following: 



Class I: SOFT events (NO mini-jets). iV soft is here large and growing with 
cm. energy and n CjSO ft quite small; P„ iSO f t obeys KNO scaling and A; soft is constant. 



Class II: semi-hard events (with mini-jets). N semi . h!ird is decreasing with 
cm. energy whereas n CiSem i-hard is increasing and it larger than n CiSO f t ; KNO scaling 
is violated and k semi . hard decreases. 
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Figure 48: Multiplicity distribution in < 0.9 for the scenario described in the text (solid line); 
the three components are also shown: soft (dashed line), semi-hard (dash-dotted line) and the third 

(dotted line) ifioi . 



Class III: the benchmark of the new class is k TmRD < 1. Being A^ third 
(reduced to few units and) approximately equal to 1 , quite large forward backward 
multiplicity correlations are expected and — according to the results of section 14.91 
— a leakage parameter from one hemisphere to the opposite one close to its max- 
imum. The fact that hadronic clans are very few with a high particle density per 
clan should have consequences also at parton level where one should find a huge 
colour exchange process from a relatively small number of high virtuality ancestors 
presumably controlled by a mechanism harder than in the components of the other 
two classes. 

Generalising previous results we expect that P^ tal be obtained as the weighted 
superposition of the above mentioned three classes of events each described by a 
NB (Pascal) MD with characteristic parameters fii and ki (i = soft, semi-hard, third) 
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i.e. 



pr al = « S o ft pr ft (^oft,^ 



'soft ) 

Csemi-hard-fn (^semi-hard; ^semi-hard) (184) 



■ Q;third-Pn hlrd ( n third, &thiid) 



with asoft + asemi-hard + «thw = 1 and a soft , a sem i-hard and a third the weight factors of 
each class of events with respect to the total sample (see Fig. |47j ) 

P™ al has a characteristic elbow structure for large n and a narrow peak for n 
close to zero. Both trends are consequences of the log-convex (/c m ird < 1) gamma 
shape of the nthka -charged multiplicity distribution of the new component which 
shows a high peak at very low multiplicities and a very slow decrease for large 
ones. The contrast with Pythia Monte Carlo calculations predictions is remarkable 
as can be seen just by inspection of Figures [45] and WT[ 

The search on global properties can be extended from full phase- space to ra- 
pidity intervals. Since Tevatron data seem to favour, among our scenarios, the one 
based on strong KNO-scaling violation (i.e., scenario 2 with k"^ growing linearly 
in Ins), we decided to discuss in the rapidity interval \r)\ < 0.9 (which will be 
available at LHC with Alice detector) this scenario only. The weight factor of each 
component has been taken to be in the rapidity interval considered the same as in 
full phase space. Two extreme situations has been allowed for the third component 

(i) the third component is distributed uniformly over the whole phase space 

(ii) the third comp onent has a very narrow plateau and falls entirely within the 



interval \rj\ < 0.9 (see [108 ] for details.) 

Results are shown as a band in Fig. 581 It should be pointed out that the general 
trend of P^ tal (r] c , y/s) vs. n is quite similar to that already seen in full phase-space. 
Notice that the narrow peak at very low multiplicity (again due to the third com- 
ponent) is here hidden by the standard peaks of the soft and semi-hard components 
which are shifted to lower multiplicities than in full phase- space. In order to make 
the comparison easier, in the following Table are given the parameters for the extrap- 
olated component multiplicity distributions at 14 TeV in full phase-space, assuming 
that events of the third class comprise 2% of the total: 



FPS 


% n k N n c 


soft 
semi-hard 
third 


41 40 7 13.3 3.0 
57 87 3.7 11.8 7.4 
2 460 0.1212 1 460 



and the parameters for the extrapolated component multiplicity distributions at 14 
TeV in the pseudo -rapidity interval \rj\ < 0.9: 
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\r]\ < 0.9 


% n k N n c 


soft 
semi-hard 
third (i) 
third (ii) 


41 4.9 3.4 3.0 1.6 
57 14 2.0 4.2 3.4 
2 40 0.056 0.368 109 
2 460 0.1212 1 460 



(I) EVENTS EVENLY SPREAD OVER THE WHOLE RAPIDITY RANGE : in this case 

only 37% of the clan is contained within the pseudo rapidity interval \r]\ < 0.9, fctwid 
is even much less than 1 and n c t j,i r d ^40. 

(II) event CONCENTRATED IN \r]\ < 0.9 : the single clan is fully contained in 
\r)\ < 0.9, its parameters are of course the same as those in full phase-space, but 
particle density in rapidity is much higher than in (i). 

The average number of particles per clan both in full phase-space and in the 
rapidity interval is much larger in the semi-hard than in he soft component (semi- 
hard clans are larger than soft clans.) 

Coming to forward-backward multiplicity correlations, the overall strength for 
three components is given by 

bjDj i ^3 v^ 3 (- _ ri \' 2 

, Li=l °i l+f,, + 2 ^j=l Lj>! n j) ,..„ 

FB = 3 3 3 ' (185) 

Si=l a iT+b~ + 2 Si=l ^j>i a i a j{^i ~ ^j) 2 

By taking the quite reasonable assumption that the leakage parameter in \rj\ < 
0.9 is the same as in full phase-space, it has been found: 





FPS 


\t]\ < 0.9 




first/soft 


0.41 


0.25 




second/semi-hard 


0.51 


0.45 




third 


0.9995 


0.997 


(i) 






0.9995 


(ii) 


total (weighted) 


0.98 


0.92 





Notice that &FB,semi-hard is always larger than 6 F B,soft- Then 6 F B,thhd tends to saturate 
in all cases its maximum which is 1 . 

The total FB multiplicity correlation strength resulting from the weighted su- 
perposition of the contributions of all classes of events is larger in FPS than in the 
rapidity interval but closer to its asymptotic value. As already pointed out, stronger 
FB correlations at hadron level suggest stronger colour exchange process at parton 
level and this effect is clearly enhanced in and by the third component. 
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Clan reduction is an important phenomenon which has already observed in 
proton-nucleus collisions (see Section |4.3[ ). The attempt is to explore a possible 
link and to compare predictions in pp collisions with those on nucleus-nucleu s col - 
lisions in terms of the energy density variable, e, by using Bjorken formula [1 109 ] 



_ 3 (Et) dn 
£ ~ 2~V~ dy 



(186) 

where (E T ) is the average transverse energy per particle, V the collision volume 
and dn/dy the particle density at mid-rapidity. The volume has been estimated with 
proton radius « 1 fm and formation time r ~ 1 fm. Lacking general expectations 
for (Et), for the soft component the value measured at ISR and for the other com- 
ponents the values measured by CDF (which should be intended as a lower bound 
leading to lower bounds for energy densities as well) has been used. The results are 
summarised in the following Table: 



our scenarios 


soft semi-hard (i) third (ii) 


(i) total (ii) 


dn/dy 
(E T ) (MeV) 
e (GeV/fm 3 ) 


2.5 7 20 230 
350 500 500 500 
0.4 1.6 4.7 54 


10.8 19.2 
500 500 
2.5 4.5 



The energy density for the semi-hard component in our scenario at 14 TeV is of 
the same order of magnitude found at AGS at 5.6 GeV in O+Cu collisions (e « 1.7). 
The energy density for the third component in the spread out scenario is comparable 
with its value recently measured at RHIC in Au+Au collisions (e ~ 4.6) The energy 
density for the third component in the other extreme scenario with high concentra- 
tion is « 54, even larger, being dn/dy much larger, than the LHC expectations for 
Pb+Pb collisions (e « 15) 

Of course the above estimate are only indicative, in view of the application of 
Bjorken formula to pp collisions, as well as for the choice of the values of the param- 
eters. These considerations notwithstanding, it is possible that some characteristic 
behaviour of observables seen at RHIC in AA collisions could be reproduced at 
LHC in pp collisions. 
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5 CONCLUSIONS 



Clan concept has been originally introduced in multiparticle phenomenology in or- 
der to interpret approximate regularities observed in n charged particle MD's in all 
classes of collisions. The occurrence of the same regularity at parton level as a 
QCD Markov branching process in the solution of KUV differential equations in 
LLA with a fixed cut-off regularization prescription led to consider partonic clans 
as independent intermediate gluon sources similar to bremsstrahlung gluon jets. 

The sudden violation of the regularity at higher cm. energies and in larger ra- 
pidity intervals in pp collisions and in e + e~ annihilation, with the appearance of a 
shoulder structure in n charged particle MD's, opened a new horizon in the field and 
suggested a more accurate search at a deeper level of investigation. The attention 
on experimental data analysis moved from the full sample of events to the separate 
samples of events of the single components or substructures of the collisions. It 
has been an important discovery to find that the same regularity violated in the total 
sample of events was satisfied in the single components, each described by charac- 
teristic and in general different parameters. Their weighted superposition provided 
a satisfactory understanding of collective variables and correlations behaviours in 
multiparticle production in the examined classes of collisions where experimental 
data were available. 

Accordingly, clan structure analysis, in view also of its suggestive QCD roots, 
became an interesting tool in multiparticle dynamics. Furthermore clan existence 
itself was demanded by a convincing description of forward-backward multiplicity 
correlations cm. energy dependence in pp collisions. 

The suppression in the same reaction of the average number of clans of the semi- 
hard component in the TeV energy region in the QCD inspired scenario as well as 
in the scenario with strong KNO scaling violation (both obtained by extrapolating 
our knowledge on data in the GeV energy domain), together with the huge increase 
of the average number of particle per clan, was surprising. It suggested the onset of 
a new scenario in the TeV energy domain characterised by the appearance of a third 
class of presumably quite hard events (to be added to the soft and semi-hard ones) 
with few clans, maybe just one, with the characteristics of high particle density 
fireballs. The possible anticipation of this new class of events at 14 TeV would be 
easily experimentally detectable and its finding would shed some light also on the 
link of pp collisions with heavy ion collisions, where fireballs are expected to be 
much larger. 

Along the years, clans as groups of particles exhausting all correlations inside 
each clan and generated by independent intermediate particle sources (the clan an- 
cestors) revealed additional properties which helped to make their physical nature 
more precise, up to the extreme thought that clans might be real physical objects 
controlling the hadronization process and whose partonic partners might be QCD 
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parton showers originated by intermediate independent gluon sources. 

Clan concept turns out to be in this perspective more general than jet concept, 
in that no kinematical cuts are needed for its definition, and it is also not a purely 
statistical concept like a standard cluster in cluster expansion in statistical mechan- 
ics (as thought at the time of its introduction) in that we learned that a mass can 
be attributed to clans and that these masses could be quite different in each com- 
ponent and vary from one collision to another one. Assuming that clan existence 
would be also verified experimentally, a new intriguing question would possibly 
concern other quantum numbers of these intermediate massive objects occurring in 
the production process. 

In conclusion, it is not clear how far we went in clarifying the enigma of mul- 
tiparticle dynamics evoked in the introduction to the present paper. This was not 
indeed our main goal. We wanted to point out how successful and inspiring has 
been in our search the continuous dialog between theory and experiment and to fo- 
cus the attention of the reader on the development of a series of experimental facts 
and theoretical ideas which might, hopefully, transform an enigma in an Arianna 
thread in the labyrinth of multiparticle dynamics in its awkward journey toward 
QCD and open new perspectives in pp and heavy ion collisions in the TeV energy 
domain. 
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Appendix A Physical and mathematical properties of the NB (Pascal) MD 

Cornerstone of the present approach to multiparticle dynamics has been the search 
for regularities of collective variables behaviour in the components or substructures 
of the various collisions. In view of the relevance in this framework of the NB (Pas- 
cal) MD, after some historical notes on its origin and its use, its main mathematical 
and physical properties, with particular emphasis on clan structure analysis and par- 
ticle shower development, are summarised for the benefit of the reader. All these 
properties have been used indeed in the present paper in various occasions and are 
at the basis of the interpretation of the NB (Pascal) regularity itself. 



A. 1 Historical notes 

A few remarks will help to illustrate the main motivations of the wide use of the 
negative binomial (Pascal) multiplicity distribution (NB (Pascal) MD) and in more 
general terms of the class of compound Poisson multiplicity distributions in science. 

The NB (Pascal) MD with integer parameter k was known already to Blaise 
Pascal pi 10 ] (that is the reason why it was decided with P. Carruthers J ill] few 
months before his passing, to add the name 'Pascal' to the distribution so extensively 
used in multiparticle dynamics by of all of us.) The distribu tion appeared fifty years 



later in the Volume Essay d' analyse sur le jeux de hazard [112]. The front page of 
the Volume points out that the Volume has been published with the permission of the 
king Louis XIV and describes the negative binomial as the probability distribution 
of the number of tosses of a coin necessary to get a fixed number of heads [45]. 
Gambling was indeed the favourite game of the nobility in that century and the 
discovery of its statistical rules a real achievement. 

The NB (Pascal) MD is used in modern times in many fields (biology, econo- 
metrics, medicine. . . ). Its first applications to the spreading of a disease in terms of 
sickness proneness of various g roup s of individuals goes back to 1920 and is due to 



M. Greenwood and G.U. Yule [113] 



The distribution appears almost in the same years in quantum statistical me- 
chanics in order to describe n Bose particle multip licit y distribution in k identical 



systems, each containing on average n/k particles [114] . 

The distribution becomes later of fundamental importance in quantum optics in 
the study of n -photon MD from a partially coherent source [ 1 1 15] ; it is remarkable 
that the photon distribution is Poissonian when the source is coherent (k = oo) and 
it reduces to a NB (Pascal) MD with k — 1 (geometric, i.e., Bose-Einstein) when 
the source is thermal. 

After its introduction in high energy collisions in the accelerator region in 1972, 
the NB (Pascal) MD became quickly — as already pointed out — a stimulating phe- 
nomenological tool for describing multiparticle production general trends in all 
classes of collisions. 
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Many are the reasons of this success. 

Firstly, the NB (Pascal) MD is a compound Poisson distribution and represents 
one of the the simplest two-parameter MD correcting the independent n-particle 
production process suggested for instance by the multi-peripheral model predic- 
tion, i.e., a Poisson multiplicity distribution in the average number of particles n. 
The generating function of the NB (Pascal) MD can indeed be written as an expo- 
nential of a logarithmic distribution with an intriguing positive numerical factor in 
front whose interpretation in terms of clan concept is one of the main subject of 
this review. The reduction of the average population within each clan to one unit, 
i.e., the equality of the average number of particle of the full distribution with the 
average number of clans, leads to the Poisson MD of the full n-particle distribu- 
tion. Deviations from unity in the average number of particles per clan represent 
the correction to the Poissonian behaviour (see Section [273] ) 

Secondly, the NB (Pascal) MD contains a set of multiplicity distributions in 
correspondence to different limiting values of its standard parameters, as shown in 
Section IA.31 below. 

Thirdly, the NB (Pascal) MD is a hierarchical distribution, i.e., all multiplicity 
correlations can be expressed in terms of second order moments, controlling two- 
particle correlations (see Section [273] ) 

Fourth, NB (Pascal) MD's as solutions of the differential QCD evolution equa- 
tion in LLA can be interpreted as Markov branching processes of two (parton) pop- 
ulations evolving with different strengths (corresponding to the dominant QCD ver- 
tices) down to the final (parton) configuration (see Section 13.11 ) 

A. 2 Different parametrisations of the distribution used in multiparticle dynamics 

A. 2. A The standard parametrisation of n charged particle multiplicity distribu- 
tion, P n , is usually given in terms of the average charged multiplicity, n, and 
the positive parameter k, which is linked to the dispersion D of the distribution 
(D = \/ n 2 — n 2 ) by the relation k = n 2 / (D 2 — n). D 2 is therefore larger than n. 
Accordingly 



fc(A; + l)...(A; + ra- 1) 
n! 



n 



P n (n, k) = P (n, k) 



n 



(187) 



n + k 



with 




(188) 



The corresponding generating function is 



oo 




G{n 



,k;z) = ^P n (n, £;) 



(189) 



n=0 
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with G(n, k;l) = l as normalisation condition. 



A.2.B The distribution has been also defined in terms of the coefficients a and b 
of a linear recurrence relation between P n+1 and P n multiplicities, i.e., 

(n + l) %^ffi = a + 6n, (190) 
P n {a,b) 

and P n (n, k) from above becomes 

, ,s , ^a(a + b)...(a + b(n — 1)) 
P n (a,b) = PoM ,, n (19D 

with 

P (a,6) = (l-6)°/ 6 . (192) 
The corresponding generating function is 

oo / i i \ a/6 

G(a,M) = ^P„(a,&);z n = (— -J . (193) 

n=0 ^ ^ ' 

Notice that 

A; = a/6 and n = o/(l-6). (194) 

This definition is particularly suitable in an approach to multiparticle dynamics 
based on stimulated emission where the parameter b/a corresponds to the fraction 
of particles already present stimulating the emissions of an additional one [66]. 

A.2.C Being the NB (Pascal) MD a compound Poisson distribution, it can be 
written by using another set of parameters, i.e., the parameters of clan structure 
analysis N (the average number of clans) and n c (the average number of particle 
per clan, n c = n/N) and the corresponding generating function turns out to be 

G(N, n c - z) = exp[iV(G log (n c ; z) - 1)] (195) 

with 

— / 77 \ CI 

N = k\n[l + -) =-■ -W1-6) (196) 
V kJ b 

and 

n —b 
Uc = fcln(l + n/fc) = (1-b) ln(l -b)' (197) 

G\ og (n c ; z) is the logarithmic multiplicity generating function and is equal to ln(l — 
bz)/ ln(l — b). In this framework, 1/k is an aggregation parameter, i.e., the ratio 
between the probability that two particle belong to the same clan to the probability 
that they belong to different clans [166] . 
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A.2.D Notice that Gi og (n c ; 0) = and that exp(-iV) = G(n, k; 0). It follows that 
the zero particle probability is given by 

exp(-TV) = P (n, k) = ( = (1 - b) a ' b . (198) 



k + n / 

This result can be extended to any interval, Aw, of a generic variable w, (e.g., 
w could be (pseudo)-rapidity or transverse momentum or coordinate in the real 
space). In this case the probability to find zero particles in Aw, Po(n(Aw), k(Aw)), 
is defined and an intriguing connection with clan structure parameters in Aw is 
established, i.e., 

N(n(Aw),k(Aw)) = -\nP (n{Aw), k{Aw)) (199) 

and 

n c {n{Aw), k{Aw)) = — - (200) 

Vo{Aw) 

Vq being the void function of Section 12.31 

Of particular interest is indeed the following theorem: factorial cumulant struc- 
ture is hierarchical, i.e., n-order factorial cumulants are controlled by second order 
factorial cumulants, iff the function Vo(Aw) scales with energy and Aw as a func- 
tion of n(Aw) multiplied by the second order factorial cumulant. Being for the NB 
(Pascal) MD the second order factorial cumulant equal to 1/ k(Aw), it follows that 

fn(Aw)\ 1 
\k{Aw) J Vq(Aw) 

i.e., the distribution is hierarchical, as K n = f(K 2 ). 

The use of Vq(Aw) and Po(n(Aw), k(Aw)) is of particular interest in the study 
of rapidity gaps in the various collisions [54[] . 

A.2.E The NB (Pascal) MD can be obtained also by the weighted superposition 
of Poissonian MD's with average multiplicity fi\ = \n and gamma weight 

f(X,k) = ^ k)k ~ 1 exp(-kX). (202) 
(fc- 1)! 



It follows 



(A™)\-nA 



Pn(n,k)= / ^-^e- nX f(k,X)d\ 

k \ k k(k + l)...(k + n-l) ( n 



n + k J n\ \n + k y 

This representation of the distribution has been used in Section \A2\ in order to de- 
scribe corrections to the multi-p eriph eral model predictions. It is known in quantum 



optics as "Mandel's equation" [ 1115 ] and in mathematics as "Poisson transform" 
with gamma weight [761] . 
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A.2.F The convolution of two NB (Pascal) MD's with different parame ter k but 
the same parameter b, P n (k, b) and P n >(k', b') is again a NB (Pascal) MD [ 1 16] . In 
fact 

£ p n (k, b)PAk', b) = (i - ( ^,| " " 

n+t^iv JV!(* + *'-l)! (204) 

= P N (k + k',b). 

A3 The one-dimensional limits of the parameters of the distribution 
A. 3. A The limit b — > of P n (a, 6) leads to a Poissonian MD: 

lim P n (a, 6) = P(A-, (205) 
6->o n\ 

with P = e~ a and in this limit a = n. i.e. 

- n n 

P n (n) = e~ n —. (206) 
n\ 

The corresponding generating function (GF) is 

G{a;z) =exp[a(z- 1)]. (207) 
Notice that the limit 6 — > corresponds to a — > n and fc — > oo. 

A.3.B The limit a — > 0, with constant b, leads to the logarithmic MD: 

limP n (a,6) = P 1 (b) — , (208) 

with Pi (6) = —6/ ln(l — 6) (n is always > 1). Therefore 

p n {b) = -6 n /nln(l - b). (209) 
Notice that this limit corresponds to k — >• 0, with constant 6. 

A.3.C The limit a = b, i.e., k = 1, leads to the geometric MD: 

P n (b,b) = P Q (b)b n , (210) 

with Po(6) = (1 — b), and therefore 

P n (b) = (l-b)b n . (211) 

The average multiplicity of the geometric distribution is 6/(1 — b) and the corre- 
sponding GF: 

G(b;z) = (1 -&)/(! -&*). (212) 
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A.3.D For n 3> k one gets the gamma distribution 

1 k k 

lim PJn, k) = -——(n/nf- 1 exp(-Wn). (213) 
n>fc nT(k) 

The distribution is concave for k > 1 and convex for k < 1. It becomes an expo- 
nential for A; = 1. Notice that D 2 /n 2 is approximately equal to 1/k a nd th at the 



maximum value of the gamma MD is obtained for n/n = 1 — 1/ k, see [108[ | 



A.3.E The (positive) binomial limit: For a > and 6 < 0, and a/6 an integer 
number, the negative binomial MD becomes a standard binomial MD and —k (now 
positive) is the maximum multiplicity. — 1/k can be seen as an anti-aggregation 
parameter, in that particles like to stay far apart from each other. 

A. 4 Physical informations contained in the differential of the generating function 
of the multiplicity distribution with respect to its parameters considered as 
independent variables 

Let us consider the differential of the NB (Pascal) MD GF, 

1 - b ^ a/b 



G(a,b;z) = I — -j , (214) 

with respect to its independent variables after the introduction of v = 1/(1 — 6) and 
remembering that k = a/b, i.e., the differential of 



G(a,b;z) = — , (215) 

\ V + z — vz J 

with respect to z, v and k variables, it follows 

dlnG(a, b; z) = - — ^——dz + lnG(a, b; z)^— + k [G geom (u; z) — 1] -, (216) 

where Ggeoml^; z) = (u + z — vz)~ l is the GF of the geometric MD. 

A. 4. A By taking k and b (v) constant, the above equation becomes 

d 

(l-bz)—G(a,b;z)=aG(a,b;z). (217) 
oz 

The new equation is the differential form in the generating function language of the 
linear relation between n + 1 and n particle MD's, Eq. ( |190| ). 
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A.4.B For z and b (y) constant one obtains that In G(n, k; z) is proportional to k: 

lnG{n,k;z) = klnG(n/k,l; z) (218) 

The new equation reminds us in the GF language that the corresponding MD is a 
CPMD. 

A.4.C By taking z and k constant and integrating the differential between 1 and 
v one has 

r dv' 
In G(n, k-z) = k / [G(v', z)-l]—. (219) 
Ji v 

Notice that for b — (v — 1) the GF is equal to one. This result is strictly related to 
the clan structure expressed by the relation 

G{n, k; z) = G Poisson (JV; G log (b; z)) , (220) 

with 

N = k\nv (221) 

and 

G log (b; ^) = ^J i G geom (u'; z)dv'lv'. (222) 

The NB (Pascal) MD is therefore generated by independent emission of geometric 
clans with mean multiplicity v' in the interval (1, v) and the average number of 
geometric clans in (z/ , v' + dv') is kdv' jv' . 

It should be pointed out that the GF of the geometric MD, G geom (u; z), obeys 
the differential equation 

d_ 

dv 

In terms of probabilities, it describes a typical self-similar branching process as that 
found in the Markov process version of the KUV model of gluon shower when the 
g — > qq branching is neglected, as discussed in Section [3T[ 

A. 5 Informations contained in the differential of the logarithmic MD GF 
Let us now consider the GF of the logarithmic MD: 

<^w = w$- (224) 

A simple calculation shows that 

n mi ti h ,dG log {b-z) . . (l~b)z 

" (1 " b) ln(1 " h) ^T~ = ~ G ^ b] Z) + T^bz- (225) 

— —G\ og (b] z) + zG geom (b; z), 



v—. G geom(v, z) = G geom (v; z)(G geom (v\ z) - 1). (223) 
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or by using the variable v = 1/(1 — b): 



In v = -Gf tog (6; z) + zG geom (b; z). (226) 

For an infinitesimal change, the MD of an average clan, which is logarithmic, 
evolves by addition of a geometric distribution. The first term in the equation is en- 
suring the normalisation condition G\ og (b; 1) = 1. Accordingly, the logarithmic clan 
can be taken as an average over geometric clans as shown explicitly in Eq. (|222[ ). 
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